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NOVEL HUMAN PROTEINS, POLYNUCLEOTIDES ENCODING 
THEM AND METHODS OF USING THE SAME 

FIELD OF THE INVENTION 

The invention relates to polynucleotides and the polypeptides encoded by such 
polynucleotides, as well as vectors, host cells, antibodies and recombinant methods for 
producing the polypeptides and polynucleotides, as well as methods for using the same. 

BACKGROUND OF THE INVENTION 

The present invention is based in part on nucleic acids encoding proteins that are new 
members of the following protein families: Ca^ain-Iike, Epsin-like, Low Density Lipoprotein 
B-like, purinoceptor-like, CG8841-like, Synaptotagmin-like, Serine Protease TLSP-like, 
Glypican-2 Precursor-like, Mitogen-activated protein kinase kinase-like. Zinc finger protein 
276 C2H2 type protein and Thymosin betalO-like. More particularly, the invention relates to 
nucleic acids encoding novel polypeptides, as well as vectors, host cells, antibodies, and 
recombinant methods for producing these nucleic acids and polypeptides. 

Calpains are intracellidar cysteine proteases that are regulated by calcium. They are 
known to be involved in a number of cellular processes, such as apoptbsis, protein processing, 
cell differentiation, metabolism etc. As such, their role in pathophysiologies extends to - but is 
not restricted to - tissue remodeling and regeneration (in response to a variety of injury models 
in the eye, brain, spinal cord, kidney etc.), fertility, tumorigenesis and myopathies. One of the 
genes identified in susceptibility to type II diabetes is a calpiain (calpain-10) (Horikawa et al., 
Nat Gaiet 26(2):163-75, 2000). Polymorphisms within this gene are correlated with insulin - 
resistance. Ther^ies targeting calpain are relevant to disease areas such as cataract, spinal 
cord injury, Alzheimer's disease, muscular dystrophy, acoustic trauma, diabetes, cancer, 
learning and memory defects and infertility. Knockout and transgenic models of various 
calpains also point to a potential role for this family of proteases in a number of cellular and 
disease processes. 

Epsins are a family of proteins that bind to ENTH domain proteins such as Epsl5. 
They are involved in clathrin-mediated endocytosis as well as intracellular protein sorting. 
Some members of this family undergo phosphorylation during mitosis, hi addition, epsins are 
involved in endocytosis at synapses to compensate for secretion of neuro-transmitter 
containing vesicles. The interaction of epsin 1 with a transcription factor (promyelocytic 
leukemia zinc finger protein) has recently been demonstrated, making it likely that the 
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endocytotic machinery can cross-talk with nuclear, function. Perturbati.on of epsin function can 
lead to defects in the endocytosis of membrane receptors as well as secreted proteins like 
transferrin, with consequent side-effects. Defects in epsin may potentially lead to aberrant cell- 
cell signalling, developmental defects, aberrant neurotransmitter signalling etc. 

Low density lipoprotein (LDL) particles are the major cholesterol earners in 
circulation and their physiplpgical function is to cany cholesterol to the cells. In the process of 
atherogenesis these particles are modified and they accumulate in the arterial wall. Elevated 
serum cholesterol bound to low density lipoprotein (LDL) is a characteristic of familial 
hypercholesterolemia. Individuals .with coronary artery disease have a significantly higher 
mean lipoprotein concentration than those without coronary heart disease, suggesting that 

Hpoprotein measur^nents may help predict the risk of coronary heart 
with familial hypercholesterolemia. 

Many cells express plasma membrane receptors for extracellular molecules, temied 
purinoceptors, which appear to be coupled to a plasma membrane pore. Purinoceptors are 
15 primitive, widespread and serve many different systems. There are several subclasses of 
purinoceptors; receptors for adenosine (Pl-purinocq,tors) and receptors for ATP (P2- 
purinoceptors). As for other major transmitters such as acetylcholine, GABA. glutamate and 5- 
HT, receptors of two major families are activated by ATP, one (the P2X-purinoceptor family) 
mediates fast responses via ligand-gated ion chamiels, while the other (the P2Y-purinoceptor 
20 family) mediates slower responses via G-proteins. 

Synq>totagmins (Syts) are brain-specific Ca24-/phosphoiipid-binding proteins (Li et al 
Nature 375(6532):594-9. 1995). In hippocampal syn^ses, Syt lis essential for fast Ca(2H-). ' 
dependent syr^tic vesicle exocytosis but not for Ca(2+).independent exocytosis. In 
vertebrates and invertebrates. Syt may therefore participate in Ca(2+)-dependent synaptic 
membrane fusion, either by serving as, the Ca2H- sensor in the last step of fast Ca(2+)-triggered 
neurotransmitter release, or by collaborating with an additional Ca2+ sensor. While Syt Ibinds 
Ca2+ (refs 10, 1 1), its phospholipid binding is triggered at lower calcium concentrations 
(EC50 - 3-6 microM) than those required for exocytosis. Furthemiore, Syts bind clathrin-AP2 
wrlh hr^h affinity, indicating that they may play a genial role in endocytosis ralher than being 
confined to a specialized function in regulated exocytosis. Here we resolve this apparent 

con1xadxctionbydescribingfour,Syts,threeofwhich(SytVI,VnandyiIDarewidely • 
expressed in non-neural tissues. All Syts tested.share a common domain structure, with a 
cytoplasmic region composed of two C2 domams that interacts with clathrin-AP2 (Kd = 0 1- 
1 .0 nM) and with neural and non-neural syntaxins. The first C2 domains of Syt I. H. HI V and 
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Vn, but not of IV, VI or VIE, bind phospholipids with a similar Ca(2+)-concentration • 
dependence (EC50 = 3-6 microM). The same C2 domains also bind syntaxin as a function of 
Ca2+ but the Ca(2+)-concentration dependence of Syt I, n and V (> 200 microM) differs from 
that of Syt in and Vn (< 10 microM). 
5 Proteolytic enzjmes feat exploit serine in their catalytic activity are ubiquitous, being 

found in viruses, bacteria and eulcaryotes . They infclude a wide range of peptidase activity, 
includ^g exopeptidase, endqpeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
clans (SA, SB, SC, SE, SF and SG) on the basis of structural similarity and other fimctional 

1 0 evidence. Structures are known for four of the clans (SA, SB, SC and SB): these appear to be 
totaUy unrelated, suggesting at least four evolutionary origins of serine peptidases and possibly 
many more. Notwithstanding their different evolutionary origins, there are similarities in the 
reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and caiboxypeptidase C 
clans have a catalytic triad of serine, aspartate and histidine in common: serine acts as a 

1 5 nuclebphile, aspartate as an electrophile, and histidine as a base. The geometric orientations of 
the catalytic residues are similar between families, despite different protein folds. The linear 
arrangements of flie catalytic residues commonly reflect clan relationships. For example the 

catalytic triad in the chymotrypsin clan (SA) is ordered HDS, but is ordered DHS in the 
subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC). 
20 Glypicans are a family ofheparau sulfate proteoglycans that are anchored to the 

plasma membrane via a glycosylphosphatidylinositol modification. The six glypican genes 
identified so far show distinct developmental and tissue expression patterns in mice. Glypicans 
could potentially also be secreted away from the membrane by proteolysis and the soluble 
protein could potentially act as a dominant-negative inhibitor of the intact protein. This family 
of proteins has been implicated in neuronal development, guidance and regeneration. It may 
thus have a role in syn^tic plasticity. One of the glypican genes in Drosophila is involved in 
the wingless and decapentaplegic signaling pathways. Deficiencies in glypican-3 in mice lead 
to a congenital overgrowth syndrome. In humans, deletions and translocations involving the 
glypican-3 gene have been associated with an X-Iinked recessive gigantism syndrome. In 
30 addition, the expression ofthis protein is silenced in an in vitro model of malignant 

mesothelioma. The novel protein, lharefore, may play a role in tissue morphogenesis and 
patterning, cell division and cell signaling. 

Mitogen-activated protein kinase kinase (MAPKK) is a dual-specificity protein kinase 
which phosphorylates and activates mitogen-activated protein kinase (MAPK). cDNAs 
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encoding two isoforms Of MAPKE^ Mi^KKl and MAPiaK2.(also,laiQsm as MEKl and 
MEK:2), have been cloned in mammalian cells (Moriguchi et al., Eur J Biochem 234(l):32-8, 

1995) . Mitogen-activated protein kinase kinase 1 (MAPKKl) and MAPKK2 function 
downstream of flie proto-oncogene product Raf in signaling pathways that affect cell 
proliferation and differentiation. The isofonns have been shown to be differentially regulated 
in two significant ways: MAPKKl, but not MAPKK2, was phosphorylated and inactivated by 
the cyclin-dependent kinase p34cdc2; and p21 Ras formed a ternary complex with 
RafMAPKKl but not with RafiMAPKK2 (Mansour et al., CeU Growth Differ 7(2):243-50, 

1996) . In a study of mouse tissues, MAPKKl was shown to be highly enriched in the brain 
while MAPKK2 is present realtively evenly. Both isoforms were shown to reside in the 
cytoplasm and both are activated in response to nerve growth factor (NGF) and epidermal 
growth factor (BGF) (Moriguchi et al., Eur J Biochem 234(l):32-8, 1995). 

A starfling number of cDNA clones encode proteins that contain one or more 
sequences that match the zinc finger consensus domam, revealing that zinc finger protems 
represent perhaps the largest class of DNA binding proteins in eukaryotes and that zinc finger 
protein-controlled gene expression may be a fimdamental aspect of development as weU as 
other processes. Structurally distinct clusters of zinc fingermodules define an extremely large 
superfamily of nucleic acid binding proteins with several hundred, perhaps thousands of 
different members in vertebrates. C2H2 type zinc finger proteins (ZFPs) are one of the most 

complex members ofzinc fingermodules (Pieleretal.,MolBiolRep20(l):l-8, 1994 and 
Berg et al., Annu Rev Biophys Biophys Oiem 19:405-21, 1990). 

The beta-thymosins comprise a femily of stiucturaUy related, hi^y consented acidic 
polypeptides, originally isolated, from calf thymus. A nmnber of peptides belong to this family. 

They include, thymosin beta-4 is a small polypeptide that was first isolated as a thymic 
hormone and induced terminal deoxynucleotidyltransferase, thymosin beta-9 (andbeta-8) in 

bovine and pig, thymosin beta-lO in man and rat, thymosin beta-1 1 and beta-12 in trout and 
human Nb thymosin beta. They found in high quantity in thymus and spleen but are also 
^videly distributed in many tissues. They have been shown to bind to actin monomers md thus 
to inhibit actin polymerization 

Tliymosin betalO is a small conserved acidic protein involved in the inhibition of actin 
polymerization. Studies have demonsti-ated that thymosin betalO expression is regulated by 
exti-acellular signals that stimulate growth of thyroid cells both in vitro and in vivo, and 
suggest a role for this protein in thyroid diseases' characterized by proliferation of follicular 
cells (10366416). Other studies have demonstrated that thymosin beta-lO is overexpressed in 
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rat thyroid transformed cell lines and in human thyroid carcinoma tissues and cell lines. This 
evidence suggests that thymosin beta- 10 detection may be considered a poteatial tool for the 
diagnosis of several human neoplasias (1 0487837). 



5 SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as NOVX, 
or NOVl, NOV2, NOV3, NOV4, NOV5, NOV6, NOV7, NOV8, N0V9, NOVIO and NQVl 1 
nucleic acids and polypeptides. These nucleic acids and polypeptides, as well as derivatives, 

10 bomologs, analogs and fragments thereof, will hereinaflCT be collectively designated as 
"NOVX" nucleic acid or polypeptide sequences. 

In one aspect, the invention provides an isolated NOVX nucleic acid molecule 
encoding a NOVX polypq)tide tiiat includes a nucleic acid sequence that has identity to the 
nucleic acids disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 

15 and 33. In some embodiments, the NOVX nucleic acid molecule will hybridize under 

stringent conditions to a nucleic acid sequence complementary to a nucleic acid molecule that 
includes a protein-coding sequence of a NOVX nucleic acid sequence. The invention also 
includes an isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, 
analog or derivative thereof For example, the nucleic acid can encode a polypq)tide at least . 

20 80% identical to a polypeptide comprising the amino acid sequences of SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34. The nucleic acid can be, for example, a 
genomic DNA fragment or a cDNA molecule that mcludes the nucleic acid sequence of any of 
SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33. 

. Also included in the invention is an oligonucleotide, e.g^., an oUgonucleotide which 
25 includes at least 6 contiguous nucleotides of aNOVX nucleic acid (e.g^., SEQ ID NOS:l, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33) or a complement of said oUgonucleotide. 
Also mcluded in the invention are substantially purified NOVX polypeptides (SEQ ID NOS:2, 
4, 6, 8, 10, 12, 14, 16, 18, 20. 22, 24, 26, 28, 30, 32 and 34). In certain embodiments, the 
NOVX polypeptides include an amino acid sequence that is substantially identical to the 
30 amino acid sequence of a human NOVX polypeptide. 

The invention also features antibodies that immunoselectively bind to NOVX 
polypeptides, or fragments, homologs, analogs or derivatives thereof 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and aphannaceutically- 

5 
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acceptable earner. The therapeutic can be, e.^., a NOVX nucleic acid, a NOVX polypeptide, 
or an antibody specific for a NOVX polypeptide; In a further aspect, the invention includes, in 
one or more containers, a then5)eutically- or prophylactically-effective amount of flus ; 
pharmaceutical composition. 

in a further aspect, the invention, includes a method ofproducing a polypeptide by 
culturing a cell that includes a NOVX nucleic .^cid, under conditions allowing for expression 
of the NOVX; polypeptide encoded by the DNA. If desired, the NOVX polypeptide can then 
be recovered. > . 

In another aspect, the invention includes a method of detecting the presence of a 
NOVX polypeptide in a sample, hi the method, a sample is contacted with a compound that 
selectively binds to the polypeptide under conditions allowing for formation of a complex 
between the polypeptide and the compound. The complex is detected, if present, (hereby 
identifying the NOVX polypeptide within the sample. , 

The invention also mcludes methods to identify specific cell or tissue types based on 
15 their expression of a NOVX. 

Also included in the invention is a method of detecting the presence of a NOVX 

nucleic acid molecule in a sample by contactingthe sample with a NOVX nucleic acid probe 
or primer, and detecting whether the nucleic acid probe orprimer bound to a NOVX nucldc 
acid molecule in the sample. 

20 In a fiirther aspect, the invention provides a method for modulating &^ 

NOVX polypeptide by contacting a cell sample that includes the NOVX polypeptide with a 
compomid that binds to the NOVX polypeptide m an amount- sufficient to modulate the 
activity of said polypeptide. The compound can be, e.g., a smaU molecule, such as a nucleic 
. . acid, peptide, polypeptide, peptidomhnetic, carbohydrate, hpid or o&^ 
25 containing) or inorganic molecule, as furttier described herein. 

Also within the scope of the invention is the use of a therapeutic in the manufacture of 
a medicament for treating or preventing disorders or syndromes including, e.g.. Von Hippel- ~ 
Lmdau (VHL) syndrome, cirrhosis, transplantation disorders, pancreatitis, obesity, diabetes, 
autoimmune disease, renal artery stenosis, mterstitial nephritis, glomerulonephritis, polycy^c 
kidney disease, systemic lupus erythematosus, renal tubular acidosis, JgA nephiopattiy. 
hj^ercalcemia, Lesch-Nyhan syndrome, developmental defects, cataract, spinal cord nljury, 
Alzheimer's disease, muscular dystrophy, acoustic tramna,. cancer, learning and memory 
defects, infertiUty, cardiomyopathies, atherosclerosis, hypertension, congenital heart defects 
aortic stenosis, atrial septal defect, atrioventricular canal defect, ductus arteriosus, puhnonar^ 
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Stenosis, subaortic stenosis, ventricular septal defect, valve diseases, tuberous sclerosis, 
scleroderma, endometriosis, hemophilia, hypercoagulation, idiopathic thrombocytopenic 
purpura, immunodeficiencies, graft versus host disease, dementia, stroke, Parkinson's disease, 
Huntington's disease, cerebral palsy, epilepsy, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neurodegeneration. Familial 
hypercholesterolemia, hyperlipoproteinemia II phenotype, tendinous xanthomas, corneal 
arcus, coronary artery disease, planar xanthomas, webbed digits, hypercholesterolemia;- 
fertility, xanthomatosis. Hepatitis C infection, regulation, synthesis, transport, recycling, or 
turnover of LDL receptors. Cerebral arteriopathy with subcortical infarcts and 
leukoencephalopathy. Epiphyseal dysplasia, multiple 1, Ichthyosis, nonlamellar and 
honerythrodermic, congenital. Leukemia, T-cell acute lymphoblastoid, Pseudoachondroplasia, 
SCID, autosomal recessive, T-negative/B-positive type, C3 deficiency. Diabetes mellitus, 
insulin-resistant, with acanthosis nigricans, Glutaricaciduria, type I, Hypothyroidism, 
congenital, Leprechaunism, Liposarcoma, Mucolipidosis IV, Persistent MuUerian duct 
syndrome, type I, Rabson-Mendenhall syndrome. Thyroid carcinoma, nbnmeduUary, with cell 
oxyphiha, Erythrocytosis, famihal. Malaria, cerebral, susceptibiUty to. Bleeding disorder due 
to defective thromboxane A2 recq>tor. Cerebellar ataxia. Cayman type, ConvulsionSi familial 
febrile, 2, Cyclic hematopoiesis, Fucosyltransferase-6 deficiency, GAMT deficiency. 
Cirrhosis, Psoriasis, Actinic keratosis. Tuberous sclerosis. Acne, Hair growth, allopecia, 
pigmentation disorders, endocrine disorders, trauma, immunological disease, respiratory 
disease, gastro-intestinal diseases, reproductive health, neurological diseases, bone marrow 
transplantation, metabolic and endocrine diseases, allergy and inflammation, nephrological 
disorders, hematopoietic disorders, urinary system disorders. Atopy, Osteoporosis- 
pseudoglioma syndrome; Smith-Lemh-Opitz syndrome, type I; Smith-Lemh-Opitz syndrome, 
type H; Xeroderma pigmentosum, group E, subtype 2; Asthma, atopic, susceptibihty to; 
Diabetes mellitus, insulin-dependent, 4; Susceptibility to IDDM; Angioedema, hereditary, 
Paraganghoma, famiUal nonchromaffin, 2; neuroprotection; Lambert-Eaton myasthenic 
syndrome, digestive system disorders, all or some of the protease^rotease inhibitor deficiency 
disorders, diabetes mellitus non-insulin dependent, Acyl-CoA dehydrogenase, deficiency of 
long chain, Brachydactyly, type Al, Carbamoylphosphate synthetase I deficiency. 
Cardiomyopathy dilated II, Cataract Coppock-Iike, Cataract crystaUine aculeifoim. Cataract 
polymorphic congenital. Cataract variable zonular pulverulent. Cataracts punctate progressive 
juvenile-onse, Choreoaflietosis famiKal paroxysmal, Craniofacial-deafiiess-hand syndrome. 
Ichthyosis lamellar, type 2, Myopathy, desmin-related cardioskeletal. Resistance/susceptibility 
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to TB, Rhabdomyosarcoma alveolar, Waarderibibrg sjmdrome -type I and ^e in, Alport 
syndrome autosomal recessive, Bjomstad syndrome. Hematuria, femilial benign. 
Hyperoxaluria primary, type 1, Syndactyly type 1, Hyperproglucagonemia, Betbleni. • 
myopathy, Brachydactyly type E, Brachydactyly-mental retardation syndrome, Finnish lethal 
neonatal metabolic syndrome, susceptibiUly to 2, Simpson-Golabi-Behmel syndrome, type I 
and type 2, Beckwith^^iedemann syndrome, pathogen infections, heart disease, prostate 
cancer, angiogenesis and wound healing, modulation of apoptosis, neuropsychiatric disorders, 
age-related disorders, pathological disorders involving spleen, thymus, lung, and peritoneal 
macrophages and/or other pathologies and disorders of the like. 

The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, or a NOVX- 
specific antibody, or biologicaUy-active derivatives or fragments thereof - . : . . . - ; 

For example, tiie compositions of the present invention will have efficacy for treatment 
of patients suffering from the diseases and disorders disclosed above and/or otherpathologies 
and disorders of the like. The polypeptides can be used as immunogens to produce antibodies 
specific for the invention, and as vaccines. They can also be used to screen for potential 
agonist and antagonist compounds. For example, a cDNA encoding NOVX may be useful in 

genetherapy.andNOVXnmybeusefiilwhenadministeredtoasubjectiiineedthereo£ By 
way of non-limiting exan^le, the compositions of the present invention wiU have efficacy for 
treatment of patients suffering from the diseases and disorders disclosed above and/or other 
pathologies and disorders of the like. • 

The inventioii further includes a method for screening for a modulator of disorders or 
syndromes including. e.g., the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The method includes contacting a test compound with a NOVX 
polypeptide and determining if the test compound binds to said NOVX polypeptide. Binding 
of the test compomrd to Ihe NOVX polypeptide indicates the test compound is a modulator of 
activity, or of latency or predisposition to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 
activity, or of latency ofpredisposition to disorders or syndrorhes including, e.g:, ihe dseases 
and disorders disclosed above and/or other pathologies and disorders of the like by 
administering a test compound to a test animal at increased risk for the aforementioned 
disorders or syndromes. The test animal expresses a recombinant polypeptide encoded by a 
NOVX nucleic acid. Expression or activity ofNOVX polypeptide is then measured in flie test 
ainmal, as is expression or activity of the protein in a control animal which recombinantly- 
expresses NOVX polypeptide and is not at increased risk for the disorder or syndrome. Next, 
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the expression of NOVX polypeptide in both.the test animal and the control animal is 
compared. A change in the activity of NOVX polypeptide in the test animal relative to the 
control animal indicates the test compound is a modulator of latency of the disorder or 
syndrome. 

5 In yet another aspect, the invention includes a method for determining the presence of 

or predisposition to a disease associated with altered levels of a NOVX polypeptide, a NOVX 
nucleic acid, or both, in a subject (e.^., a human subject). The method includes measuring the 
amount of the NOVX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the NOVX polypeptide present in a 
10 control sample. An alteration in the level of the NOVX polypeptide in the test sample as 

compared to the control saniple indicates the presence of or predisposition to a disease in the 
subject. Preferably, the predisposition includes, e.g., the diseases and disorders disclosed 
above and/or other pathologies and disorders of the like. Also, the expression levels of the new 
polypeptides of the invention can be used in a method to screen for various cancers as well as 
15 to determine the stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject 
a NOVX polypeptide, a NOVX nucleic acid, or a NOVX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
20 preferred embodiments, the disorder, includes, the diseases and disorders disclosed above 
and/or other pathologies and disorders of the like. 

In yet another aspect, the invention can be used in a method to identity the cellular 
receptors and downstream effectors of the invention by any one of a number of techniques 
. ..commonly employed in the art. These include but are not limited to the two-hybrid system, 
25 affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 

NOVX nucleic acids and polypeptides are fiutiier usefiil in the generation of antibodies 
that bind immuno-specifically to the novel NOVX substances for use in therapeutic or 
diagnostic methods. These NOVX antibodies may be generated according to methods Imown 
in the art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX 
30 Antibodies" section below. The disclosed NOVX proteins have multiple hydrophiUc regions, 
each of which can be used as an immunogen. These NOVX proteins can be used in assay 
systems for fimctional analysis of various human disorders, which will help in underetanding 
of pathology of the disease and development of new dmg targets for various disorders. 
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10 



15 



20 



25 



The NOVX nucleic acids and proteins identified here may be useful iii potential 
therapeutic applications unplicated in (but not limited to) various pathologies and disorders as 
indicated below. The potential-therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene 
therapy (gene dehvery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and ceU types composing (but not limited to) those defined her^^ 

Unless otherwise defined, all technical and scientific tenns used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivdent to those described herein cm 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All pubUcations, patent appUcations, patents, and other references 
mentioned herein are incorporated by reference in their entirety. In the case of conflict, the 
present specification, including definitions, will control In addition, the materials, methods, 
and examples are iUustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESORIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded thereby. 
Included in the invention are the novel nucleic acid sequences and their encoded polypeptides. 
The sequences are collectively refeired to herein as "NOVX nucleic acids" or "NOVX 
polynucleotides" and the con-esponding encoded pol^^ 

polypeptides" or "NOVX proteins." Unless indicated otherwise, "NOVX" is meant to refer to 
any of the novel sequences disclosed herein. Table A provides a summary of the NOVX 
nucleic acids and their encoded polypeptides. 

TABLE A. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 



Sa 



Internal Identification 



3352274 



21421174 



AC025263 dal 



AC026756 dal 



5b 



6a 



sggc_draft_dj895c5_ 
20000811 dal 



CG54443-02 



SEQ ID 

NO 
(nucleic 
acid) 



1 



SEQ ED NO 
(polypeptide) 



SC134912642 dal 



11 



13 



10 



12 



14 



Homology 



Calpain-like 



Epsin-like 



Low Density Lipoprotein B-like 



Purinoceptor 



CG8841-like 



CG884Mike 



Synaptotagmin-like 
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ob 


' ' CG5o 106-01 


15 • 


16 


SynaptotagmiD-like 


7 


_______ 1 1 n T_ < . 

wugc drait n DhOyolm 
21 20000809 dal 


17 


18 


Serme Protease TLSP-like 


oa 


134913441 EXT 


19 


20 


Glypican-2 Precursor-like 


8b 


CG50970-02 


21 


22 


Glypican-2 Precursor-like 


8c 


CG50970-03 


23 


24 


Glypicaii-2 Precursor-like 


8d 


CG50970-04 


25 


26 


Glypicaii-2 Precursor-like 


o 


78 


LI 


2S 


Mitogen-activated protein kinase 
kinase 2-like 


10 


sggc draft c333el 
20000804 da2 


29 


30 


Zinc Finger Protein 276 C2H2-type 


11a 


GMAC079400 A 


31 


32 


Thymosin beta 10-like 


lib 


CG109754-01 


33 


34 


Thymosin beta 10-like 



NOyX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NO VX nucleic acids and polypeptides according to the 
invention are useful as novel members of the protein families according to the presence of 
domains and sequence relatedness to previously described proteins. Additionally, NOVX 
nucleic acids and polypeptides can also be used to identify proteins that are members of the 
family to which the NOVX polypeptides belong. 

NOVl is homologous to a Calpain-like family of proteins. Thus, the NOVl nucleic 
acids, polypeptides, antibodies and related compoimds according to the invention will be 
usefiil in therapeutic and diagnostic applications implicated in, for example; Von Hippel- 
Lindau (VHL) syndrome, obesity, diabetes, autoimmime disease, systemic lupus 
erythematosus, Lesch-Nyhan syndrome, developmental defects, Alzheimer's disease, 
muscular dystrophy, acoustic traxmoa, cancer, learning and memory defects, infertility and/or 
other pathologies/disorders. 

NOV2 is homologous to a Espin-like family of proteins. Thus NOV2 nucleic acids, 
polypeptides, antibodies and related compounds according to the invention will be useful in 
therapeutic and diagnostic applications implicated in, for example; cardiomyopathies, 
atherosclerosis, hypertension, congenital heart defects, obesity, infertility, cancer, autoimmune 
diseases, allergies, developmental defects, dementia. Von HHppel-Lindau (VHL) syndrome , 
Alzheimer's disease, stroke, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, 
Lesch-Nyhan syndrome, multiple sclerosis, leukodystrophies, neurodegeneration and/or other 
pathologies/disorders. 

NOV3 is homologous to a family of Low Density Lipoprotein B-like proteins. Thus, 
the NOV3 nucleic acids and polypeptides, antibodies and related compoimds according to the- 
invention will be useful in therapeutic and diagnostic applications implicated in, for example: 
Familial hypercholesterolemia, coronary artery disease, diabetics, atherosclerosis. Hepatitis C 
infection. Thyroid carcinoma, Vori Hippel-Lindau (VHL) syndrome. Cirrhosis, 
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TransplaBtation, Psoriasis, Actixnc keratosis,. .Tuberous. sclerosis, Acne,v Hair growth..- 
allopecxa, pigmentation disorders, endocrine disorders and/or other pathologies/disorders 

^OV4 IS homologous to the Purinoceptor-like family of proteins. Thus. N0V4nuck^^ 
acids, polypeptides, antibodies and related compounds according to the invention wiU be 

. ^^^^^therapeuticanddiagnosticappHcationsimpHcatedinvarious^^^^^^^ 
disorders. 

NOV5ish<»nolog™stoaeCG8841.Iik.p,„temfi™ly. Thus NOV5 nucleic acid. 
^l»«ptid«, anSbodie. arf related co,r^ acco„tt,g to fl,e invention will be naeM in 
therapeutic and diagnoatic wHcaSons in^Ucated in, for example: cancer, trauma, 

mnnunclogical diaeaae, respiratory disease, ga^o-inteatinal mseasea, reproductive heaia, 
neurological and neurodegenerative diseases, bone manow transplantation, metabolic and 

»docrme diseases. aUergy and inflammation, nephrological disorders, hematopoietic • 
''^<»^=^«fa«!rysystem disorders and/orotherpathologies/a^ .., . 

, NOV6 is homologous to the Synaptotagmin-likeftmily of proteiis. -[tasNOV6 
nucletc acids, polypeptides, antibodies and related compounds acco^Bng to the invcrtion wiH 
be useful m flrerapeutic and diagnostic applicadons in^Ucated in, for example: A*^y; Von 
Hrppel-Lmdau (VHL) syndrome, Abheimefs disease, stnto. tuberous acidosis - . - ■ 

Sis """^^ 'Pa.ply. Lesch- 

^11^ ataxia-telangiectasia, l^kodystr^hies. behavioral^ 

mil ™ »«*0Hc di^. Lambert-Eaton 

myasthemcsyndromeand'oroflierpalhologies/disoiders. 

mts. «,e NQYT nucletc actds, polypeptides, anu^odies and related compounds ac«^ 
*^vent.c..„:nbeuse«.in«,e.apeutic.ddia^ostic^^^^^^^ 

zz n"'T'°''°^'"""'°"'''^^'^^™*-^--^'°"°-ofthe ■ • 

protease^teasen^bitor deficiency disord^s and/or other pathologies/diso^^^ 

NOV8 T '^°''^'"*°*^°'^''--2^~-lik«familyofproteius. Itas 
NOV?n„ctac acds and pol^eptides. antibodies and related compounds acoonSngtole 

—nwmbeuseSdintherapeuticanddia^osticapphcationsin^H^ 

^.dtahetesmeUi^s non-insulin dependent autoimmur^ disease, systemic lupus 

tZlZ T ■ « -""iP'e sclerosis, ataxia-telangiecasi^ 

^dys^ph-es. neurodegeneration, cancer. Cardioniyopathy. v^ous cataract disorL 
Waardeuburg syndrome type I and type n( BJomstad ^me. Simpson-Golabi-Behmel 
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syadrome, type 1 and type 2, Beckwitfi-Wiedemahn syndrome and/or other 
pathologies/disorders . 

N0V9 is homologous to members of the Mitogen Activated Protein Kinase Kinase 2- 
like family of proteins. Thus, the NOV9 nucleic acids, polypeptides, antibodies and related 
compounds according to the invention wiU be usefid in therapeutic and diagnostic appUcations 
implicated in, for example; atherosclerosis, metaboUc diseases, pathogen infections, 
- neurological diseases and/or other pathologies/disorders. 

NOVIO is homologous to members of the Zinc Finger Protein 276 C2H2 type femily 
of proteins. Thus, the NOVIO nucleic acids, polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications impUcated 
in, for example; cancer, trauma, immunological disease, respiratory disease, heart disease, 
gastro-mtestinal diseases, reproductive health, neurological and neurodegenerative diseases, 
bone mairow transplantation, metabolic and endocrine diseases, allergy and inflammation, 
nephrological disorders, hematopoietic disorders, urinary system disorders and/or other 
pathologies/disorders. . 

NOV! 1 is homologous to members of the Thymosin beta 10-like femily of proteins. 
Thus, the NOVl 1 nucleic acids, polypeptides, antibodies and related compounds according to 
the invention will be useful in ther^eutic and diagnostic appUcations impKcated in, for 
example; prostate cancer, immunological and autoimmune disorders (ie hyperthyroidism), 
angiogenesis and wound healing, modulation of apoptosis, neurodegenerative and 
neuropsychiatric disorders, age-related disorders, pathological disorders involving spleen, 
thymus, lung, and peritoneal macrophages and/or other pathologies/disorders. 

The NOVX nucleic acids and polypeptides can also be used to screen for molecules, 
which inhibit or enhance NOVX activity or function. Specifically, the nucleic acids and ' 
polypeptides according to the invention may be used as targets for the identification of small 
molecules that modulate or inhibit, e.g., neurogenesis. ceU differentiation, cell prohferation. 
hematopoiesis, wound healing and angiogenesis. 

Additional utilities for the NOVX nucleic acids and polypeptides according to the 
invention are disclosed herein. 

NOVl 

A disclosed NOVl nucleic acid of 1947 nucleotides (also referred to as 3352274) 
encoding a novel Calpain-like protein is shown in Table 1 A. An open reading fiame was 
identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a TAG 
codon at nucleotides 1945-1947. The start and stop codons are in bold letters in Table lA. 
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Table ! A, NO VI Nucleotide Sequence (SEQ ED NO:l). 



^CCGAAGTTCATCTGTGAAGACATGAGCCGCACAGAC 
PGCAGCTGCCGCCTCCCTTACTCTGTATCCCCGGCTC 
SCATGGCTACGCAGGCGTCTTCCACTTCCAGCTCTGG 

:aggctgcccgtgcgtgaggggaagctgatgttcgtg 

:CTGGAGAAGGCCTACGCCAAGCTCCACGGCTCCTAT 
'GTGGATTTCACAGGCGGCGTGGGCGAGGTGCTC 
JCGCCATGCCCTGGCCAAGGAGTCCCTCGTGGGC 
iGAGGGCCTGGTAAAGGGACACGCGTATTCCATC 
iCGGCTGCTGCGGCTGCGGAACCCATGGGGCTGC 
TGGGACACACTCCCCACCGAGTGCCGCGATGCCi 

ATCTGCTCGCTGAGCCCGGAGGTGCTGGGCC^^^^ 



5CGTCTTCCACTTCCAGCTCTGG 
3TGAGGGGAAGCTGATGTTCGTG 
^CGCCAAGCTCCACGGCTCCTAT 
;CGGCGTGGGCGAGGTGCTCTAT 
:CAAGGAGTCCCTCGTGGGCGCC 
iGGGACACGCGTATTCCATCACG 

GAGTGGACGGGGGCCTGGAGCGACAGCTG^dcS^GCTGG^^^^ 

ctggtgaaaaaggaggatggcgaStctgS^^ 



^^^^^^^^^^^^^^^^ 



Hie NOVI nucleic acid sequence mapt to chromosome 19 and has 430 of 631 bases 
(68%)idenlioaltoaGia//>««o/tecalciumproteasemKNA(gb-GENBANK ' " 
ID.GGa.ROT|acc:X01415)(E= 1.4e-°). Sinuliarityinfom^on was assessed using public 

nucleonde databases including aU GeuBaok databases and the GeneSe, patent database 
Chromosome informaaon was assigned using OMIM and the electronic northan tool 6^ 
Curatools to derive the the chromosomal mapping of the Seqfcallins ass«nbUes.:ie^o 
clones, and/or EST sequences that were included in the invSntion. ' 
';, in aU BLAST aUgmnenis herein, the "^val^^^ 

md.cation of theprobability that Are ahgned sequences couWhaveacMevedth^ 

the BLAST query sequence by chance alone, within the database that was searched For 

example, the jnobabiHty that the subject CSbj-cf, retrieved flcm the NOV! BLAST analysis 
e.g.. ^"-^-'^calcimnproteascmSNA.matched.heQue.yNOVl sequencepurelyby ' 
chanceis l,4e- . Expect value (E) is a parameter that describes the number of hits one 
can "expect" to see jus, by chance when searching a database of a particular size. It deceases . 

exponentialIywifl.theSco,e(S).hati.assigncdtoamatchbe.weentwose,ueaces 
Essentially. theE value describes the random backgrotmd noise that exists for matches 
betweai sequences. 
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The Expect value is used as a convenient way to create a significance threshold for 
reporting results. The default value used for blasting is typically set to 0.0001. In BLAST 2.0, 
the Expect value is also used instead of the P value (probability) to report the significance of 
matches. For example, an E value of one assigned to a hit can be interpreted as meaning that 
in a database of the ciuxent size one might expect to see one match with a similar score simply 
by chance. An E value of zcto means that onie would not expect to see any matches with a 
similar score simply by chance. See, e.g., 

http://www.ncbi.iilm.iiih.gov/Education/BIASTinfo/. Occasionally, a string of X's or N's 
will result fi-om a BLAST search. This is a result of automatic filtering of the query for low- 
complexity sequence that is performed to prevent artifactual hits. The filter substitutes any 
, low-complexity sequence that it finds with the letter "N" in nucleotide sequence (e.g., 
">n^JNNNNNN") or the letter "X" in protein sequences (e.g., "XXX"). Low-complexity 
regions can result in high scores that reflect compositional bias rather than significant position- 
by-position alignment. Wootton and Federhen, M ethods Enzymol 266:554-571, 1996 . 

The disclosed NOVl polypeptide (SEQ ID NO:2) encoded by SEQ ID NO:l has 648 
amino acid residues and is presented in Table IB using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVl does not contain a signal peptide and is 
likely to be localized in the cytoplasm witii a certainty of 0.7480. 



Table IB, Encoded NOVl protein sequence (SEQ ID NO:2). 

MASSSGRVTIQLVDEEAGVGAGRLQLFRGQSYEAIRAACLDSGILFRDPYFPAGPDALGYDQLGPDSEKAKG 

VKWMRPQEFCAEPKFICEDMSRTDVCQGSLGNCWFIJU^SLTLyPRLLRRVVPPGQDFQHGYAGVFHFQLW 

QFGRWMDVWDDRLPVREGKLMFVRSEQRNEPWAPLLEKAYAKLHGSYEVMRGGHMNEAFVDFTG^^ 

LRQNSMGLFSALRHALAKESLVGATALSDRGEYRTEEGLVKGHAYSITGTHKVSLGFTK\mLLRL^ 

EWTGAWSDSCPRWDTLPTECRDALLVKKEDGEFWMELRDFIiHFDTVQICSLSPEVLGPSPEGGGW 

GRWVRGFNSGGSQPNAETFWTNPQFRLTLLEPDEEDDEDEEGPWGGWGAAGARGPARGGRTPkCTVXLSLIQ 

RNRRRLRAKGLTYLTVGFHVFQVEIDDVISADLQSLQGPYLPIaELGLEQLFQELAGEEEELNASQLQALLSI 

AI^PARAHTSTPREIGLRTCEQLLQCFGGQSIAMHFQQLWGYLLEWQAIFNKFDEDTSGTMNSYELRIAI^ 

AAGFHLNN QLTQTLTSRYRDSRLRVDFERFVSCVAHLTCIFHCSQHLDGGEGVICLTHRQVSQVVmEVATFS 

The NOVl amino acid sequence has 405 of 456 amino acid residues (88%) identical 
to, and 429 of 456 amino acid residues (94%) similar to, a Afwj musculus 720 amino acid 
residue protein (ptnr:TREMBLNEW-ACC:CACl0066) (E = 4.1e"^^^). 

NOVl is expressed in at least the following tissues: Placenta, whole organism, kidney, 
liver, pancreas, small intestine. This information was derived by determining the tissue sources 
of the sequences that were included in the iavention. 

The disclosed NOVl polypeptide has homology to the amino acid sequences shown in 
the BLASTP data listed in Table 1 C. - 
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Table lC.BL^T^resuIfs for ' 



iaene index/ 
Identifier 

gi 1 10303329 lemb I CAC 


Protein/ Organism 


Length 

(aa) 


Identity 

. , 


Positives 

to/A 


Expect 


■10066. If fAJ289241 \ 
gil 10303331 lembl CAC 


calpain 12 [Mus 
musculusj 


720 


404/456 


429/456 
(93%) 


0.0 


10068.11 (AJ2Q<i9A^^ 
gi 110303330 lembl CAC 


musculus] 


462 


404/456 
(88%) 


429/456 
(93%) 


0.0 


10067.1 1 fAJ289?4 1 1 
gi 111230800 ireflNP 


calpain 12 [Mus 
musculus], 


502 


404/456 
(88%) 


429/456 
(93%) 


0.0 


068694.11 

gi|59019l6|ref !NP 0 


calpain 12 [Mus 
musculus]' 


449 


300/342 
(87%) 


320/342 
(92%) 


le-166 


08989.1! 


calpain 11 [Homo 
sapiens] 


702 


274/706 
(38%) 


380/706 
(53%) 


le-125 



The homology between these and other sequences is showi gr^McaU^^ 
ClustalW analysis shown in Table ID. In the ClustalW alignment of the NOVl protein, as 
well as all other QustalW analyses herein, the black outlined annuo acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), wher^ non-highlighted amino acid residues are less conserved and 

can potentiaUy be altered to a much broader extent without altering protein stnzcto^ 
function. - ' , 



Table ID. ClustalW Analysis of NOVl 

1) Novel NOVl (SEQIDNd-2) 

NOVl 

gi 1 103033291 
gi 1 10303331 1 
gil 10303330 1, 
gi 1 11230800 1 
gil 5901916 I 

NOVl 

gil 103033291 
- gil 103033311 
gil 10303330 1 
gx 111230800 1 
gll 5901916 1 

NOVl 
,.gi 110303329 1 
gi 110303331 I 
gil 10303330 1 
gi 111230800 1 
gij 59019161 

NOVl 



gi 1103033291 .181 
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417 
4X8 
418 [S 
418 
391 E 
417 



gi 1 103033311 181 

gi 1 10303330 1 181 

gi 1 11230800 1 181 

gi 15901916 I 178 



NOVl 

gi 1 10303329 1 
gi 1 10303331 1 
gi 1 10303330 I 
gi I 112308001 
gi 159019161 

NOVl 

gi 1103033291 
gi I 103033311 
gi 1 10303330 | 
gi 1 11230800 I 
gi I 5901916 1 . 

NOVl 

gi 110303329 I 
gi 110303331 1 
gi 1 10303330 I 
gi 111230800 1 
gi 159019161 

NOVl 

gil 103033291 
gi 110303331 I 
gi 1 10303330 1 
gi 1 11230800 1 
gil 5901916 1 

NOVl 

gil 103033291 
gil 10303331} 
gil 10303330 1 
gi 1 112308001 
gil 59019161 

NOVl 

gil 103033291 
gil 10303331} 
gil 103033301 
gi 1112308001 
gil 59019161 

NOyi ; 

gil 10303329 I 
gil 10303331 I 
gil 10303330 I 
gil 11230800 J 
gi 1 59019161 

NOVl 

gil 103033291 
gil 10303331 1 
gil 103033301 
gil 112308001 
gil 59019161 

NOVl 

gil 103033291 
gil 10303331 1 
gi 110303330 I 
gil 11230800 I 
gil 5901916 I 




TSEGLVKGHAYSgTGTHKj 

tdeglvkgeaysvtgthls 
tdeglvkghaysvtgthkS 
tdeglvkghaysvtgthkg 
tdeglvkghaysvtg thks 

TDi2!lLVi3GHAYSVTG0ra58 



SLGFTKVRLLRLRNPWGgVEWgGgWSDS 
SLGFTKVRLLRLRNPWGRVEwS GgWSDS 
SLGFTKVRLLRLRNPWGRVEwE GgWSDS 
SLGFTIsVRLLRLRNPWGRVEwHgR WSDS 
IS LG FTK\^R LLRLRN P WGRVE wB cfi WSDS 

SBSSSSSlBrBrn pwgrHewS gSws d s 




297 
297 
297 
297 
297 
297 



iGgsgEGGG^VH 357 

G:SjAGGg|[iH 357 

|g:S:AGGcJ!ih 357 

|G5SjAGGG^IH 357 

-grg 344 

iTBSGDYKS-yEBlTT 356 



358 TIFQ 
358 IjFO 
358 IjFr 
358 IFC^ 

344 ~ -J 

357 FimglslSR^ 





MGNWVCTCLVgLMQECNW 

[QVEIBIdVIS ADLQSLQGPY 472 

IqIP^ELLDLWDSPRSRALLPGLL 477 

IqIPEE: GDR 462 

PRALAGT-AARRPL-GFL 475 

G — T TPTQALGWWA 435 

EjHGFSEIFTNSREVSSQLR 476 

4 73 LP LELGLEQLFQELAGEEEELN-ASQ 497 

478 RADRSVFCARRDVSRRCRLPPGHYLWPSASRVGDEADFTLRIFSERSHTAVEIDDVISA 537 
4 62 — — , 

476 RPPR 

436 LP 

477 LPP — . 



GGgrPKCTj^LLgLlQRNRRRLRAKGLTi 
GGSvPKCTiVjLL : LIQRNRRCLRAKOLi^ 
/PKCTjvjLL : LIQRNRRCLRAKksjtTi 
/PKCTf?|LL S LIQRNRRCLRAlKd^T! 

.::WDYEijEPgQTJILP ---PL - 

HAgQQGAOUQTIGFVLYAVPKEE^^ijiro^ 




462 

-RKPSLSPAAWPLPGGTQRLARRR 502 

APWGMNRDAGRR 449 

-GEYIIIPSTEEPHRDADFLLRVFTEKHSESWELDEVN— . YAE 519 



497 LQA 

538 DLDALQAPYKPLELELAQLFLELAGEEEELNALQLQTLISIALEpiGGmTRT^ 
462 — — — — • — — — — — . 

502 — 

449 



LLSIALEPARAHTSTPREIGLRT. 523 
IGLRT. 597 

— 4 62 

— 502 



520 



QLQEEKVSEDDMDQDFLHLFKIVAGEGKEIGVYELQRLLNRMAIKFKSFKTKGFGLDACR 



449 
579 



lit ^^^^^^^^^"^QS^^^^QQLWGYLLEWQAIFNKFDEDTSGTMNSYELRLALNAAGF •582 
598 CEQLVQCFGRGQRLSLHHFQELWGHLMSWQATFDKFDEDASGTMNSCELRLALTAAGFHL ' 657 



4 62 
502 
449 



462 
'502 
449 



580 C»aiNLMDKDGSGKLGLLEFKILWKKLKKWMDIFRECDQDHSGTLNSYEMRLVIEK^^ 639 



^^o ?!^^^^^^^^^^™^SRLRVDFERFVSCVAHLTCIF-HCSQHLDGGEGVICLTHRQVSQVW 641 
NNQLTQSLT^^ 

ZZIIII"! : 4 62 

' r 502 



658 
4 62 
502 
449 



640 NNKVMQVLVARYADDDLIIDFDSFISCFLRLKTMFTFFLTMDPKNTGHICLS LEQW 695 



642 MEVATFS 64 8 
714 SEVATFS 720 

4 62 4 62 

502 502 

449 449 

696 LQMTMWG 702 
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The presence of identifiable domains in NOVl. as weU as all other NO VX proteins 
was determined by searches using software algorithms such as PROSITE. DOMAIN Blocks 
Pfam, ProDomain, and Prints, and then detenmning the Mterpn) number by crossing'the 
domain match (or numbers) using the Inteiprb website (ht^:www.ebi.ac.u]c/ interpro) 
DOMAIN results for NOVl, as disclosed in Tables IE and IF, were collected from the 
Conserved Domain D.t.b... (CDD) with Reverse Position Specific BLAST analyses Ihis 
BLAST analysis software samples domains found in the Mart and Pfem collections For 
Tables IE, IF ^^d all successive DOMAIN sequence aligmnents. fully conserved single 
residues areindicatedbyblack shading or by the sign (1) and "s^^^^^^ 
aremdxcated by grey shading orbythesign(+).TT.e "strong"^ 
residues may be any one of the following groups of amino acids: STA, NEQK, NHOK. 

ndeq,qhrk:,milv,milf,hy,fyw. - 

Movi ^T'- """^ '''^ ftomDOMAIN analysis results against 

NOVL This mdicates that the NOVl sequence has properties similar to those of olher 
proteins known to contain these domains. 



Table IE. Domain Analysis of NOVl 



gnl I Pf am f pf amO 0648 . Peptidase C2 rair.= ,-„ * 

(SEQ ID NO: 89) Pi=iaaae_C2, Calpain family cysteine- protease 



(SEQ ID NO: 89) 

^nll^ " ^fsidues, 100.0% aligned 

- 343 bits (881 .), Expect = i.-q^ 



NOVl 

00648 

NOVl 

00648 

NOVl 

00648 

NOVl 

00648 

NOVl 

00648' 

NOVl 

00648 



105 



56 



165 



116 



222 MGLFSALRHALAKESLVGA— - 



104 



55 



164 



115 



221 



175 



176 



+ I + ++ 1 + I l+i 



TALSDRGEYRTEEGLVKGHAYSITG' 



ITHKVSLGFTKVRL 278 



279 



236 



mLGNXXKK^^.HGSLI^CSXDXTSPVDMkL^ii^^^^ 
• ^-^-W«^VKWTGD„SDSSPOWNXVDPDKK^o^,iiiiii^3^i;^ 



339 SLS 341 
+ 1 + 

296 NLT 298 



235 



VQIC 338 
295 
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Table IF. Domain Analysis of NOVl 



gnl I Smart I 5inart00230 . CysPc, Calpain--like thiol protease family.; 
Calpain-lite thiol protease family (peptidase family C2) . Calcium 
activated neutral protease (large subunit) . (SEQ ID NO: 90) 
Length « 323 residues, 99.1% aligned 
Score 342 bits (877), Expect = 4e-95 



NOVl 

00230 

NOVl 

00230 

NOVl 

00230 

NOVl 

00230 

NOVl 

00230 

NOVl 

00230 



27 
1 

87 

56 
147 



FRGQSYEAIRAACLDSGILFRDPYFPAGPDALGYDQLGPDSEKAKGVKWMRPQEFCAEPK 8 6 
I I II +1 11+ I II II 111 I +1 +- ir I I II i +1 

FENQDYEELRQECLEEGGLFVDPLFPAKPS SLFFSQLQRK--^ FWWKRPHE I FE DPP 5 5 

FICEDMSRTDVCQGSLGNCWFLAAAASLTLYPRLLRRWPPGQDFQHGYAGVFHFQLWQF 14 6 

J M N + l II N + n III l + l II I I I l + l l + l II 1+ I++ 

LIVGGASRTDICQGVLGDCWLLATOAALTLREELLARVIPKDQEFSENYAGIYHFRFWRY 115 

GRWMDVVVDDRLPVREGKLMFVRSEQRNEFWAPLIiEKAYAKLHGSYEVMRGGHMNEAFVD 20 6 

l + l + lll + IIIM 1 1 + 1+ I IMII+ Itlllllli I II ++II II I 

116 GKWVDWI DDRLPT YNGDLLE^3HSNSRNEFWSAH.EKAYAKLRGCYEALKGGSTTEALED 175 

207 FTGGVGEVLYLRQNSMG IiFSALRHALAKESLVGArALSDRG-— EYRTEEGLVKGHA 260 

III! I + I++ i II l+l + ll+l + + I + lllllll 

17 6 LTGGVAESIELKKISKDPDELFKDIJKKAFERGSLMGCSIGAGTAVEEEEQKRN^^ 235 

2 61 YSITGTHKVSLGFTKVRLIjmiNPWGCVEWTGAWSDSCPRWDTIiPTECRDAL--LVKKEDG 319 
. ^ N + l +1 + +IIIIIII1I 11 ] III ) i ++ I + II +11 

236 YSVTDVREVDGR--RRQKLLRLRNPWGESEWNGPWSDDSPEWRSVSAEEKKNLGLTMDDDG 294 

320 EFWMELRDFLLHFDTVQICSLSPEVL 345 

IMI IN f I |+|(+| 1+ 
295 EFWMSFEDFLRHFTKVEICNLRPDWF 320 



Cysteine protease activity is dependent on an active dyad of cysteine and histidine, the 
order and spacing of these residues varying in the 20 or so known families. Famihes CI, C2 
and CIO are loosely termed papain-Hke, and nearly half of all cysteine proteases are found 
exclusively in viruses. Calpain is an intracellular protease involved in many important cellular 
functions that are regulated by calcium. The protein is a complex of 2 polypeptide chains 
Gight and heavy), with three known forms in mammals: a highly calcium-sensitive (i.e., 
micro-molar range) fomi known as mu-calpain, mu-CANP or calpain I; a form sensitive to 
calcium in the miUi-molar range, known as m^calpain, m-CANP or calpain H; and a third 
foim, known as p94, which is found in skeletal muscle only. All three fomis have identical 
' Ught but different heavy chains. The heavy chain comprises four domains: domain 2 contains 
the catalytic region; domain 4 binds calcium and regulates activity. Domain 2 shows low 
levels ofsequence snnilarity to papain; although the catalytic His has not been located by 
biochemical means, it is likely that calpain and papain are related. Domain 4 has four EF hand 
calcium-binding regions and is simmilar to sorcin and the Ca2+-binding region of calpain Ught 
chain. Calpain shows preferential cleavage for Tyr-with leucine or valine as the P2 residue. 

Calpain is unique among the cysteine protease family of enzymes in that it combines 
thiol protease activity with cahnodulin-like activity. The enzyme is implicated m a number of 
pathophysiological conditions (Donkor, CurrMed Chem 7(12):1 171-1 188, 2000). Proteases of' 
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the caspase and calpain families have been impUcated in neuiodegenerative processes,, as their 
activation can be triggered by calcium influx and oxidative stress (Chan and Mattson J 
Neurosci Res 58(l):167-90. 1999). Mitochondrial calpain plays an essential role in apoptotic 
commitment by cleaving Bax at its N-terminus and generating the Bax.^18 fragment, which in 
turn mediates cytochrome c release and initiates apoptotic execution (Gao and Dou J Cell 
Biochem 80(i):53-72, 2001). Deficiency of the nCL-4 calpain protease has been imphcated in 
neoplastic transformation (Liu et al.. J Biol Chem 275(40) :3 1093-8, 2000). Calpain proteases 
have been implicated in axon and myelin destruction following injury since they degrade 
structural proteins in the axon-myelin miit and may be responsible for destruction of 
myelinated axons adjacent to the lesion site following traumatic injury of Ihe spinal cord 
(Shields et al., JNeurosci Res 61(2): 146-50, 2000). Spenn calpain has been shown to be a 
novel component of the biochemical processes that regulate the fertilidng capacity of human 
spermatozoa (Rojas and Moretti-Rojas, Int J Androl 23(3):163.8, 2000). Findings have 
indicated that modulation of calpain activity contributes to muscular dystrophies by disrupting 
normal regulatory mechanisms influenced by calpains (Tidball and Spencer, Int J Biochem 
Cell Biol 32(1): 1-5, 2000). 

The above defined information for NOVl suggests that tMs calpain-like protein m^ 
Amotion as a member of the calpain family. Therefore, the NOVl nucleic acids and protems of 
the mvention are usefiU in potential therapeutic applications impKcated in various diseases and 
disorders described below and/or otherpafhologies. For example, the NOVl compositions of 
the present invention will have efficacy for treatment of patients suffering from Von Hippel- 
Lmdau (VEDL) syndrome, cirrhosis.transplantation disorders,, pancreatitis, obesity, diabetes 
autommaune disease, renal artery stenosis, interstitial nephritis, glomerulonephritis, polycystic 
kidney diseaie, systemic lupus erythematosus, renal tubular acidosis, IgA nephropathy • 

hypercalcemia, I^sch-Nyhan syndrome, developmental defects, cataract, s^^^ 
Alzhemier's disease, muscular dystrophy, acoustic trauma, cancer, learning ^d memory" ' 

d^ects and infertility. TlieNOVl nucleic acid encoding calpain-^^^ 
hke protein of the invention, or fragments thereof, may further be useful in diagnostic 
^phcatiohs. wherein the presence or amount of the nucleic acid or the protein are to be 
30 assessed. - . ■ . 

NOV2 

A disclosed N0V2 nucleic acid of 1796 nucleotides (also referred to as 21421174) 
encodinganovelEpsin-IikeproteinisshowninTable2A. An open reading frame was 
Identified beginning with an ATG initiation codon at nucleotides 40-42 and ending with, a 

::. 20 
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TAA codon at,nucleotides..l771-1773: Putative untranslated regions upstream from the 
intiation codon and downstream from the termination codon are underlined in Table 2A. The 
start and stop codons are in bold letters. 



Table 2A. NOV2 nucleotide sequence (SEQ ID NO:3). 



ATCGGGGCCCTGTGCCC CTTGCT6CTGCAGCCGGGCACCA TGTCGACCTCGTCOTTgncr;nr:rrnr2aT^ 

AACATCGTCCACAACTACTCAGAGGCGGAGATCAAGGTTCGAGAGGCCACGAGCAATGACCCCTGGGGCCCA 
TCCAGCTCCCTCATGTCAGAGATTGCCGACCTCACCTACAACGTTGTCGCCTTCTCGGAGATCATGAGCATG 
ATCTGGAAGCGGCTCAATGACCATGGCAAGAACTGGCGTCACGTTTACAAGGCCATGACGCTGATGGAGTAC 
CTCATCT^GACCGGCTCGGAGCGCGTGTCGCAGCAGTGCAAGGAGAACATGTACGCCGTGCAGACGCTGAAG 
GACTTCCAGTACGTGGACCGCGACGGCAAGGACCAGGGCGTGAACGTGCGTGAGAAAGCTAAGCAGCTGGTG 
GCCCTGCTGCGCGACGAGGACCGGCTGCGGGAAGAGCGGGCGCACGCGCTCAAGACCAAGGAAAAGCTGGCA 
CAGACCGCCACGGCCTCATCAGCAGCTGTGGGCTCAGGCCCCCCTCCCGAGGCGGAGCAGGCGTGGCCGCAG 
AGCAGCGGGGAGGAGGAGCTGCAGCTCCAGCTGGCCCTGGCCATGAGCAAGGAGGAGGCCGACCAGCCCCCG 
TCCTGCGGCCCCGAGGACGACGCCCAGCTCCAGCTGGCCCTTAGTTTGAGCCGAGAAGAGCATGATAAGGAG 
■GAGCGGATCCGTCGCGGGGATGACCTGCGGCTGCAGATGGCAATCGAGGAGAGCAAGAGGGAGACTGGGGGC 
AAGGAGGAGTCGTCCCTCATGGACCTTGCTGACGTCTTCACGGCCCCAGCTCCTGCCCCGACCACAGACCCC 
TGGGGGGGCCCAGCACCCATGGCTGCTGCCGTCCCCACGGCTGCCCCCACCTCGGACCCCTGGGGCGGCCCC 
CCTGTCCCTCCAGCTGCTGATCCCTGGGGAGGTCCAGCCCCCACGCCGGCCTCTGGGGACCCCTGGAGGCCT 
GCTGCCCCTGCAGGACCCTCAGTTGACCCTTGGGGTGGGACCCCAGCCCCTGCAGCTGGGGAGGGGCCCACG 
CCTGATCCATGGGGAAGTTCCGATGGTGGTGGGGTCCCGGTCAGTGGGCCCTCAGCCTCCGATCCCTGGACA 
CCGGCCCCGGCCTTCTCAGATCCCTGGGGAGGGTCACCTGCCAAGCCCAGCACCAATGGCACAGCAGCCGGG 
GGATTCGACACGGAGCCCGACGAGTTCTCTGACTTTGACCGACTCCGCACGGCACTGCCGCCCCTCTCCCGG 
ATCCTTCCAGGAGAGCTGGAGCTGCTGGCAGGAGAGGTGCCGGCCCGAAGCCCTGGGGCGTTTGACATGAGT 
GGGGTCAGGGGATCTCTGGCTGAGGCTGTGGGCAGCCCCCCACCTGCAGCCACACCAACTCCCACGCCCCCC 
ACCCGGAAGACGCCGGAGTCATTCCTGGGGCCCAATGCAGCCCTCGTCGACCTGGACTCGCTGGTGAGCCGG 
CCGGGCCCCACGCCGCCTGGAGCCAAGGCCTCCAACCCCTTCCTGCCAGCAGGAGGCCCAGCCACTGGCCCT 
TCCGTCACCAACCCCTTCCAGCCCGCGCCTCCCGCGACGCTCACCCTGAACCAGCTCCGTCTCAGTCCTGTG 
CCTCCCGTCCCTGGAGCGCCACCCACGTACATCTCTCCCCTTGGCGGGGGCCCTGGCCTGCCCCCCATGATG 
CCCCCGGGGC CCCCGGCCCCCAACACTAATCCCTTCCTCCTAgykATCCAGGGCGGAAGGGGGCCT^ 



The disclosed NOV2 nucleic acid sequence, localized to chromsome 19, has 1338 of 
1563 bases (85%) identical to ^Homo sapiens EH domain-binding mitotic phosphoprotein 
(EPSBSf) mRNA (gb:GENBANK-ID: AF073727|acc: AF073727) (E = 1 .4e"^^^). 

A NOV2 polypeptide (SEQ ID NO:4) encoded by SEQ ID N0:3 has 577 amino acid 
residues and is presented using the one-letter code in Table 2B. Signal P, Psort and/or 
Hydropathy results predict that NOV2 does not contain a signal peptide and is likely to be 
localized to the mitochondrial matrix space with a certainty of 0.4600 and to the cytoplasm 
with a certainty of 0.4500. 



Table 2B. Encoded NOV2 protein sequence (SEQ ID NO -4) 



The NOV2 amino acid sequence has 569 of 577 amino acid residues (98%) identical 
to, and 569 of 577 amino acid residues (98%) similar to, a/fomo sapiens 576 amino acid 
residue protein (ptnr:TREMBLNEW-ACC:BAB 14041) (CDNA FU12392 FB, clone 

21. 
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MAMMA1002699,-hipy sinma?to^i«w^^ nOTiJegi^ eh domam.binding protein.epsin 
mKNA (GENBANK-ID:CAB61412) (E = 3.0e '^^). . ' 

The disclosed N0V2 is expressed in at least the following tissues: Retinoblastoma, 
leiomyomas, mammary gland, bone trabecular ceUs, ovaiy, bone manow, spleen, placenta, 
heart This information was derived by determining the tissue sources of the sequences that 
were included in the invention. In addition, the sequence is predicted to be expressed in brain 
tissue because of the expression pattern of a closely related Homo sapiens EH domain-binding 
mitotic phosphoprotein (Epsin) mRNA (GENBANK-ID: gb:GENBANK- 
ID:AF073727|acc:AF073727). 

N0V2 also has homology to the amino acid sequences shown in the BLASTP. data 
Hsted in Table 2C. _ 





Table 2C. BLAST results for NOV2 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 

(%) 


Positives 
(%) 


Expect 


giJ14758059l ref IXP 
034403.11 


EH domain- 
binding 
mitotic 
phosphoprotei 
" n [Homo 
sapiens] 


576 


569/578 
(98%) 


569/578 
(98%) 


0.0 


go.] 32495591ablAAC33 
823.11 (AF018261) 


EH domain 
binding 
protein Epsin 
[Rattus 
norvegicus ] 


575 


541/577 
(93%) 


548/577 
(94%) 


0.0 


gi 170193691 ref INP 0 
37465.11 


EH domain- 
binding 
mitotic 
phosphoprotei 
n [Homo 
sapiens] 


551 


544/578 
(94%) 


544/578 
(94%) 


0.0 


gal 2072301 lablAACeo 
123.11 . fU95102^ 

gil38943951ab\AAC7B 


mitotic 
phosphoprotei 
n 90 [Xenopus 
laevis] 


609 


356/613 
(58%) 


402/613 . 
(65%) 


le-126 


608.1 1 fAF0620ft4) 


epsin 2a 

[Homo 

sapiens] 


584 


292/611 
(47%) 


348/611 
(56%) . 


le-102 



. V The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 2D. .. 

Table 2D. ClustalW Analysis of NOV2 

1) NOV2 (SEQ ID NO:4) . - 

N O:41) (AF018261) EH domam bindmg protein Epsin [Rattus norvegicus] (SEQ ID 

5 S Sl '^nS P^^g^^P™**^ »0 fXenopus laevis] (SEQ ID NO:43) 

- ' .WP°y'>-ay3|gP|AAU/80U8.j j (AF062084) epsin 2a [Homo sapiens] (SEQID NO:44) 
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N0V2 

gi 1 14758059 1 
gi 1 3249559 1 
gi I 70193691 
gi 1 2072301 1 
gi 1 38943951 

NOV2 

gi 1 14758059 I 
gi 1 3249559 I 
gil 70193691 
gi 1 2072301 1 
gil 3894395 1 

N0V2 

gil 14758059 1 
gi 1 3249559 1 
gi I 7019369 1 
gi I 2072301 1 
gil 38943951 

NOV2 

gil 14758059 1 
gil 3249559 1 
gil 7019369 I 
gil 2072301} 
gil 3894395 1 

NOV2 

gi 114758059 I 
gil 3249559 1 
gil 7019369 I 
gil 2072301 I 
gil 3894395 1 . 

NOV2 

gil 14758059 I 
gil 3249559] 
gil 7019369 I 
gil 2072301 I 
gil 3894395 1 

NOV2 

gi 114758059 1 
gil 32495591 
gil 7019369} 
gil 2072301 I 
gil 38943951 

N0V2 

gi I 147580591 
gil 3249559 I 
gil 7019369 I 
gil 2072301 I 
gil 3894395} 

NOV2 

gi} 14758059} 
gil 324 9559} 
gil 7019369 I 
gil 2072301 I 
gi 13894395} 

N0V2 

gi 1 14758059 I 
gi|3249559| 
gil 7019369 I 
gi 12072301 1 
gil 38943951 



1 
1 
1 
1 
1 
1 

61 
61 
61 
61 
52 
61 

121 
121 
121 
121 
112 
121 



iRRQMKNlVHNYSEAElKVREATSNDPWGPS 
EIRQMKNIVHNYSEAEIKVREATSNDPWGPS 
RRQMECNIVHNYSEAEIKVREATSNDPWGPS 
RRQMKNIVHNYSEAEIKVREATSNDPWGPS. 
JHH'^^^V™YSEAEIKVREATSNDPWGPS, 
FlRQMKNIVjSMYSEAEIKVREATSNDPWGPSl 



SSLMSEIADLTYWVVAFSEIMSMI 
SSLMSEIADLTYWVVAFSEIMSMI 
SSLMSEIADLTYNWAFSEIMSMI 
SSLMSEIADLTYWVVAFSEIMSMI 
S S LMSE lADLT YN VVAFSEIMSMI 
SSLMSSEIADLTYNVVAFSEIMSMi 



Wi^Ki.lMUHC3KNWRHVYKAMTLMEYLIKTGSERVgQQCKENS^YAEQTLKDFQYVDRDGKDQG 
WKRLNDHGKT^WRHVYKAMTLMEYLIKTGSERvBQQCraNfflYAfflQTLKDFQYVDRDGKDQG' 

wkrlndhgknwrhvykamtlmeyliktgservBqqckenSyaSqtlkdfqyvdrdgk 

v^r<RLNDHGKNWRHVYI<CAMTLMEYLIKTGSERvSQQCKENBYj^^ 
i^iiNpHGEa^^ 



VNVREKAKQLVALLgDE 
VNVREKAKQLVALLjgDEi 

vnvrekakqlvallgde 
vnvreecakqlvallSde' 

VNVREKAKQLVgLLgD; 

InvrekHkqlvallSde 



60 
60 
60 
60 
51 
/ 60 

120 
120 
120 
120 
111 
120 




Gprap 4 96 
GpHp 4 95 
GpHp 4 94 
|gpHp 470 
|S~Bl 472 
— A 473 . 

539 
538 
537 

513 

[TLGQPVTPAGQTPATI 530 
,G-- „ 



- 23 

3NS0OCID: <WO__02055702A2J_> 



wo 02/055702 



EeT/USOl/50925 



NOV2 

gi 1 14758059 1 
gi I 3249559) 
gi I 70193691 
gi I 2072301 1 
gi I 3894395 1 

NOV2 

gi/ 147580591 
gi I 3249559 I 
gi I 7019369 1 
gi I 2072301 1 
gi I 3894395 1 



539 

538 

537 ^ — 

513 — 

531 PFASPMMSVS 
517 TSTSFG] 




IPTYisgLG ' 

IPTYIIS l'g~ ^ 

'PTYtrs Kg 

PT^^rsjLG- 

|LANMAgiyiVGMQPMAGVPVGTLAp| 
SMAVASjDjfTSAAPQP 





NOV2 ''T- Z '""^ ""^^ ^ DOMAm ^alysi. «sul.s against 

NOV2. Tlus mtoate. fha. ft. N0V2 sequence has property aimilar to ftoseof otter 

proteins known to contain these domains. 



N0V2 
01417 



N0V2 



Table 2E Domain Analysis of NOV2 



gnl 1 Pf am \ pf amOl 4 1 7 , enth emtw w«r«=.' 
homology) doinain S fn JrS^'inv^ ^''P^^" «-tenninal 
cytoskeletal nachlnery The fS^2To^«^ ^v^''^'^ endocytosis and 
(SEQ ID NO: 91) ^ ™® functxon of the ENTH domain is unknown. 

Length = 123 residues, 92,7% aligned ' ■ 
Score = 173 bits (439), Expect =^2e-44 



AMTLMEYLIKTGSERVSQQCKENM 



76 



77 



01417 60 



1+ 1+ IK+ lilfl ]l'l~]'^Tlf]^^^^^ 131 



alillhyllrngserwqearrnnyrirel: 



IIIM l + l |j 1 + 

lEDFRKVDSSGKDQGANIRTYAKYLL 



114 



Table 2F Domain Analysis of NOV2 



gnl j Smar t ) sinartOOP ?':^ ^ enth Pr>.?-{r, m *. • 
(SEQ ID NO:^ ^^'^^^ N^terminal homology (ENTH) domain 

Length - 127 residues, 89.8% alian^n 
Score = 149 bit«? /^^^i « aligned 

Dxts (377), Expect = 3e-37 



NOV2 . 18 

01273 1 . 

N0V2 77 

01273 60 



-VAFSEIMSMIWKRLNDHGKNWRH' 



SDIEVKVRKATNNDEWGPKGJCHLRE 

AMTLMEYLIKTGSERVSQQCaCEaOJ' 
1+1+ II++ II I + 



:VYK 76 



IIQGTHNEKSSVAEIMAVLWRRLNDT-KNWRVVYK 
iYAV!3TLKDFQYVDRDGKDQGVNVREKftKQLV 131 



59 



10 



The amino acid sequence of NOV2 also has high homology to 
m Table 2G. 



other proteins as shown 
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Table 2G. BLASTX results for NOV2 



Smallest 
Sum 

Reading High Prob 
Sequences produci ng High-scoring Segment Pairs; Frame Score P(N) 



patp;aaB2423« vesicle associated prot 13 - Homo sap, 576 aa... +1 3006 1.7e-312 



Epsin (Epsl5 interactor) is a cytosolic protein involved in clathrin-mediated 
endocytosis via its direct interactions with clathrin, the clathrin adaptor AP-2, and Epsl5. The 
NH(2)-tenninal portion of epsin contains a phylogenetically conserved module of unknown 
5 function, known as the ENTH domain (epsin NH(2)-temiinal homology domain). Findings 
suggest that epsin 1 may function in a signaling pafliway connecting the endocytic machinery 
to the regulation of nuclear function (Hyman et al., J Cell Biol 149(3):537-46, 2000). 

During endocytosis, clathrin and the clathrin adaptor protein AP-2, assisted by a 
variety of accessory factors, help to generate an invaginated bud at the cell membiane. One of 
10 these factors is Epsl5, a clathrin-coat-associated protein that binds the alpha-ad^tin subunit 
of AP-2. It has been proposed that epsin may participate, together with Epsl5, in the 
molecular rearrangement of the clathrin coats that are required for coated-pit invagination and 
vesicle fission (Ghen et al.. Nature 394(6695):793-7, 1998). 

It has been shown that both rat epsin and Epsl 5 are mitotic phosphoproteins and that 
1 5 their mitotic phosphorylation inhibits binding to the appendage domain of alpha-adaptin. Both 
epsin and Epsl5, like other cytosohc components of the synaptic vesicle endocytic machinery, 
undergo constitutive phosphorylation and depolarization-dependent dephosphorylation in 
nerve terminals. Furthermore, their bmding to AP-2 in brain extracts is enhanced by 
dephosphorylation. Epsin together with Epsl5 is proposed to assist the ciatlrcin coat in its 
20 dynamic rearrangements during the invaginatiori/fission reactions. Their mitotic 

phosphorylation may be one of the mechanisms by which the invagination of clathrin-coated 
pits is blocked in mitosis and their stimulation-dependent dephosphorylation at synapses may 
contribute to the compensatory burst of endocytosis after a secretory stimulus (Chen et al., J 
Biol Chem 1999 Feb 5;274(6):3257-60). . ' 

25 The above defined information for NOV2 suggests that llie NOV2 protein may 

function as a member of a family of novel Espin-like proteins. Therefore, the NOV2 nucleic 
acids and proteins of the invention are useful in potential therapeutic applications implicated in 
various diseases and disorders described below and/or other pathologies. For example, the 
N0V2 compositions of the present invention will have efficacy for treatment of patients 

30 suffering fi-om cardiomyopathies, atherosclerosis, hypertension, congenital heart defects, 

aortic stenosis, atrial septal defect, atrioventricular canal defect, ductus arteriosus, puhnonaiy 

■ 25 
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stenos,., subaortic stenbds, ve,*iculai Sepial dsfel; valve disease. tubert,us scIemaiB 
sole^dem,^ obesiQ,, ta«plan«i„„ discrte, endo««riosi^ ii^ty. oaac«. h«cphffla 
hjpercoagulation. idiop^hic thrc»nbocytop»ic pmpu«. autoton™>e di.=a^ aUergies ' 
—deficiencies. gn*v««„tas.dis=aac. developmenW defects, dc^enda, Vonmppe,- 
^(VHL) s,™d«»e .Alzheimer's disease, stroke, hypercalcemia. Pa.^^^^ 
H»n.mgt«r's disease, cerebral palsy, epilepsy, lesch-Nyhan s^drome. multiple sclerosis ' 
a,a«a-telar,gieetaaia. leukodyst^phies, behavioral disorders, addiction, an^ety pain 
neurodeg^^ation. The NOV2 nucleic acid coding Espiu-like proteins, and Are E^-life 
pmtems of the mvention, or fagnrents thereo« may further be useful in diagnostic 

apphcations.^ereinfheprescuceoramountofthenucleicacidorthep,oteinaretobe 
assessed. ...... 

NOV3 

A disclosed NOV3 nucleic acid of 2973 nucleotides (also referred 

mT^ab^SA. Anopenreading WwasidentifiedbeginningwithaATG^^^^^ 

n^leotxdes 1 3andendingwitbaTAAcodonatnucleotides2971.29^^ Il.e start and stop 
codons are m bold letters in Table 3A. 
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CGACTCCCTGCACAGCCGTCCTGGTATGTACAGTCCTTCCTGTTTAGTTTATGCCAGGAAATTAATCGGGTT 

GGAGGCCATGCCTTGCCAAAGGTGACATTACAGGAGATGCTGAAAAGCTGTATGGTTCAAGTAGTAGCTGCC 

TATGAGAAACTCTCCGAAGAAAAACAGATTAAGAAAGAAGGTGCATTTCCAGTCACCCAGAACCGGGCGCTG 

CAGCTGCTTTATGATCTGCGTTACCTCAACATTGTTCTGACAGCCAAGGGTGACGAGGTGAAGAGTGGCCGG 

AGCAAGCCAGACTCCAGGATTGAGAAAGTGACTGACCACCTGGAAGCCCTCATTGATCCATTTGACCTG6AC 

GTTTTCACGCCACACCTCAACAGCAACCTTCATCGCCTGGTGCAGCGAACTTCTGTGCTGTTTGGATTGGTG 

ACTGGTACAGAGAATCAGCTCGCCCCCCGGAGCAGTACGTTCAACTCCCAAGAACCCCATAACATCCTGCCA 

CTGGCATCCAGTCAGATCAGGAG6TTTGGACTTCTCCCACTGAGCATGACAAGCACTCGAAAGGCTAAATCA 

ACCAGAAACATCGAAACAAAAGCTCAGGTTGGTCCCCCGGCACGCTCCACAGCTGGTGACCCGACAGTTCCT 

GGCTCCTTGTTCA6ACAGCTTGTCAGTGAAGAAGACAACACGTCTGCACCTTCATTATTCAAACTTGGCTGG 
CTCTCTAGTATGACTAAGHaA 



The disclosed NOV3 nucleic acid sequence m^s to chromosome 19pl3. 1-13.3 and 
has 2360 of 2957 bases (79%) identical to &Mus musculus IdlBp (LDLB) mRNA 
(gb:GENBANK:-ID:AF109377|acc:AF109377) (E = 0.0). 

A disclosed NOV3 protein (SEQ ID NO:6) encoded by SEQ ID NO:5 has 990 amino 
acid residues, and is presented using the one-letter code in Table 3B. Signal P, Psort and/or 
Hydropathy results predict that NOV3 does not contain a signal peptide, and is likely to be 
localized to the nucleus with a certainty of 0.7600 and to the mitochondrial matrix space wifli 
a certainty of 0.4824. 



Table 3B. Encoded NOV3 protein sequence (SEQ ID NO:6). 



MATAATSPALKRLDLRDPAALFETHGAEEIRGLERQVRAEIEHKKEELRQMVGERYRDLIEAADTIGQMRRC 
AVGLVDAVKATDQYCARLRQAGSAAPRPPRAQQVSPRAPTLRPAGGSRSPWPCRSTPPPSVSPRPRESSPSL 
ARSLSAPHPLGLYLLCCHLHSLLQLDSSSSRYSPVLSRFPILIRQVAAASHFRSTILHESKMLLKCQGVSDQ 
AVAEALCSI^ttLEESSPRQALTDFLLARKATIQKLLNQPHHGAGIKAQICSLVELLATTLKQAHftLFyTLPE 
GLLPDPALPCGLLFSTLETITGQHLPKGTGVLQEEMKLCSWFKHLPASIVEFQPTLRTLAHPISQEYLKDTL 
QKWlHMCNEDIKNGITNLIJ«nrrVKSMKGIAGIRDAmEIJLTNESTNHSWDVI^RIU,U^ 

DRIQTLTKEGFDSISSSSKELLVSALQELESSTSNSPSNKHIHFEYNMSLFLWSESPNDLPSDAAWVSVANR 
^Si'^^*°^^^°^^^^'^°"^^^^°^*^^'^^°°L^YI'PSDDSSLPKDVSPTQAKSSAPDRYADAGTVQ 
E^aRTQSVACIKHXVDCIRAELQSIEE6VQGQQDALNSAKI«SVLEMARLCLSMEI.CPHI*QCIMKSESS 

vP^^«f^iS5f^^'^^^'^®^^^^^Q^S''^QSFLFSLCQEINRVGGHAlPKVTLQ^ 

ZI3^»J^f^^^°^^^^'^^^^*^^=NQIAPRSSTFNSQEPHNILPIJ^SQIRRFGLLPLSMTSTSSs 
TRNIETKAOVGPPARSTAGDPTVPGSLFRQLVSEEDNTSAPSLFKLGWI.SSMTK ^^^-^M^STRKAKS 



The NOV3 amino acid sequence has 807 of 990 amino acid residues (81%) identical 
to, and 877 of 990 amino acid residues (88%) similar to, aMi« musculus 980 amino acid 
residue protein (ptnr:SPTREMBL-ACC:Q9Z160) (E = 0.0). The global sequence homology is 
62.396% amino acid homology and 54.576% amino acid identity. 

N0V3 is expressed in at least the following tissues based on literature sources: ovaries, 
liver, epidermis, fibroblast, blood leukocj^es. 

NOV3 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 3C. 



Table 3C. BLAST results for NOV3 



Gene Index/ 
Identifier 



Protein/ 
Organism 



1 Length 


Identity 


Positives 


1 (aa) 


(%) 


(%> 



Expect 



27 
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gi 1 14 77 6518 1 ref I XP - 
040307.11 


hypothetical 
protein 
DKFZn7 62L1 710 
[Homo 
sapiens] 




844/902 
(93%) 


853/902 
(93%) 


0.0 ' 


•gi|15011849lref INP 


038609. 2| 


low density 
lipoprotein B 
[Mus 

musculus] 


980 


799/994 
(80%) 


871/994 
(87%) 


0,0 


ga.|14776514 Iref IXP . 


040308.11 


hypothetical 

protein 

DKFZp762L17iO 

[Homo 

sapiens] 


666 


660/667 
(98%) 


661/667 
(98%) 


0.0 


i : 

'gi|7243l43|dbi )BAA9 


2619.1] (AB03780;?^ 


KIAA1381 
^iutein tnomo 
sapiens] 


961 


892/951 
(93%) 


901/951 
(93%) 


0.0 


gi 111360291 I pirl ITS 


0629 


hypothetical 
protein 
DKFZp762L1710 
• 1 [Homo 
sapiens] 


438 


436/439 
(99%) 


436/439 
(99%) 


0.0 









. The homology of these sequences is shown graphicaUy in the QustalW analysis shown 
in Table 3D. 



Table 3D. ClustalW Analysis of NOV3 

1) NOV3 (SEQ ID NO:6) ... 

2) gi|14776518lref|XP n4n^fV7 i[ Wnthrtirnl rrotciuDCTr-iiTOT i7in i u.. • wo^ 

5) g|11360291M rr'in6?g hypo<hetical protein DKFZp762L1710.1 [HoJsapii^fgi^ roNO:49) 



N0V3 

gi 1 14776518 I 
gi 1 15011849 I 
gi| 14776514 I 
gi I 72431431 . 
gi|1136029iy 

NOV3 V 

gi 1 147765181 
gi 1 150118491 
gi| 14776514 I 
gi I 72431431 
gi) 113602911 

NOV3 . 

gi 1 14776518 1 
g± 1150118491 
gil 14776514 I 
gi I 7243143 r ^ 
gil 113602911 

NOV3 

gil 14776518 1 
gil 15011849 1 
gil 14776514 I 
gil 72431431 
^i I 11360291) 

NOV3 . . . \ . 
gil 14776518 I 
gi 115011849 I 



1 

1 , 
1 
1 

1 • 
1 

61 
11 
61 

■ l" 
60 
1- • 

121 
62 
111 

1 ■ 
' 111 
1 

181 
121 
170 
1 



mTAATSPAIJ^D^^^ 





GLVDAV lATDQYCARLRQAGSWAPRrapRA 
GLVDAVj SaTDQYCARLRQAGS^PrBprA 
G LVDAVgATDQYCAR LROAr; pp^^^ ^ 



EAADTIGQMRRCAlSGLVDAViSJATDQYCARLRQAGSKSAPRSPRAB 



VSPRAPTLRPAGGSR 

PQQPSQ 

--PQPPS 

PQQPSQ ' 



ILSAPHPLC 
hoCLHATC 



liKLELEigEpciiSrsslEAg-Qeij^ 



gLYLLCCHLHSLLQLDSsSsRYSPiL 
gLYLLCCHLHSLLQLDSsHsRYSpSL 
SLYLLCCHLHSLLOLDfislac,pvcpHT 



60 
10 
60 
1 

59 
1 

120 
61 
110 
1 

110 
1 

180 
120 
169 
1 

169 

• 1 

240 
180 
229 
1 

229 
1 
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9± 1147765141 


i 


9II 72431431 


230 


gxl 113602911 


1 


N0V3 


301 


gi 1147765181 


241 


gi 1150118491 


290 


gi 1147765141 


1 


gi| 72431431 


290 


g±| 113602911 


1 


NOV3 


360 


ail 14776518 1 


301 


gi 1 150118491 


350 


oi 1 14776514 I 


37 


CTi 1 7243143 1 




ai 1 11360291 r 


\ 


NOV3 


420 


<7i \ 14776518 1 


0 ox 


gi| 15011849 1 


410 


oi 1 14776514 1 


^ / 


oi 1 7243143 1 


410 


<Ti 1 11360291 1 


X 


NC3V3 


480 


tri 1 14776518 1 




o-il 150118491 




ail 14776514 1 


x^ / 


oi 1 7243143 1 


4 7 n 


oi I 11360291 1 


X 


NOV3 


540 


oi 1 14776518 1 


ft ox 


oi 1 1501184 Q 1 


59P 
o^co 


oi ? 14776514 1 


91 7 
^X / 




5^rk 


03 1 i 1 ^6n9Q1 1 


X 


NOV3 


599 


^x 1 X4 / / oaj.a 1 




oil 15011849 1 




oi 1 14776514 1 


97 £ 1 


oi 1 7243143 1 


5P Q 
SO 7 


oi 1 11360291 1 


4P 

4 0 1 


NOV3 


659 1 


gi 1147765181 


600 1 


gi 1150118491 


648 3 


gi 1147765141 


336 


gi 172431431 


649 


gi 1113602911 


108 



LLARKATIQaLLNQaHHGAGIKAQICSLVELLATTLlSQAHALFYTLPEG 



LFSTLETi 
LFSTLETi 
LFSTLETi 



QH 


LP- 


KG 




QH 


PAG 


KG 


Rg] 


QH 


|ptg| 


KG 





lqigemklcswfkhlpasivefqptlrtlahpisqeylkdt 
lqHemklcswfkhlpasivefqptlrtlahpisqeylkdt 



lfstlethtsqh 



||PAG|^TgV 



mklcswfkhlpasivefqptlrtlahpisqeylkdt 

LQaEMKLCSWFKHLPASIVEFQPTLRTLAHPISQEYLKDT 



NOV3 719 

gi 1 14776518 I 660 

gi 1 15011849 I 708 

gi 1 14776514 I 396 

gi I 7243143 1 . 709 

gi I 11360291 I 168 

NOV3 779 

gi|14776518| 720 

gi 1 15011849 I . 7 68 

gi|14776514| ' 456 

gi I 7243143 1 769 

gij 112160291 1 228 

NOV3 839 

gi|147765l8I 780 

gi|15011849I 828 



lqkwihmcnedikngitnllmyvksmkglagirdamwelltnestnhswdvlcrrllekp 

LQKWIHMCNEDIKNGITNLLMYVKSMKGLAGIRDAiyiWELLTNESTNHSWDVLCRRLLEKP 
LQKWlEMCNEDIKNGlgNLLMYVKSMKGLAGIRDASwigLLgNES^HSWigvgcfflRLLEKP 
LQKWIHMCNEDIKl>3GITNLLMYVKSMKGLAGIRDAMWELLTNESTNHSWDVLCRRLLEKP 

lqkvjihmcnedikngitnllmyvksmkglagiBdamwelltnestnhswdvlcrrllekp 



LLFWEDMMQQLFLDRLQTLTr<CEGFDSISSSSKEL"LVSALQELESSTSNSPSNKHIHFEYW 
LLFWEDMMQQLFLDRLQTLTIvEGFDSISSSSKELLVSALQELESSTSNSPSNKHIHFEYN 
LLFWEDgMQQLFLDRLQTLTjgEGFaSISlSsSKELLVSALOELE gMm Mc^n.qMrcHW 

llfwedmmqqlfldrlqtltkegfdsisssskellvsalqelesstsnspsnkhihfeyn! 

LLFWEDMMQQLFLDRLQTL TKEGFDSISSSSKELLVSAL0ELESSTSNSP5NKHTHFF.VM 



MSLFLWSES PNDLPS DAAWVS VANRGQagggGLSMKAQAI S PCVQN FCSALDSKLKVKLD 
MSLFLWSESPNDLPSDAAWVSVANRGQFASSGLSMKAQAISPCVQNFCSALDSKLKVKLD 
MSgFLWSESPWDLPSDAAWVSVANRgQFASSGLSMECAQAISPCVQNFCSALDSKLKVKLD: 
lMSLFLWSESPNDLPSDAAWVSVANRGQFASSGLSMKAQAISPCVQNFCSALDSKLKVKLD 
MSLFLWSESPNDLPSDAAWVSVANRGQ FASSGLSMKAQAISPCVQNFCSALDSKLKVKT.n 



DLLAYLPSDDSSLPKDVSPTBQAKSSAFDRYADAGTVQEfyJLRTQSVACIKHIVDCIRAEL 

dllaylpsddsslpkdvsptHqakssafdryadagtvqemlrtqsvacikhivdcirael 

DLLAYLPSgDg3L2KD2g3PTgQB[<i2sAFDRYADgGTVQ[gMLRTQSVACIK^vEciE^^^ 

dllaylpsddsslpkdvsptHqakssafdryadagtvqemlrtqsvacikhivdcirael 

DLLAYLPSDDSSLPKDVSPTaQAKSSAFDRYADAGTVnRMT.R^n.q^7ArTPfRTunnTi.nrrT 
^^^^aBBLPKDVSPTHQAKSSAFDRYADAGTVQEMLRTOSVACIKHIVDCIRAELl 



QSIEEGVQGQQDALNSAKLHSVLFMARLCtJSLGELCPHLKQCILGKSESSEB^PAREFRA. 
QSIEEGVQGQQDALNSAKLHSVLFMARLCQSLGELCPHLKQCILGKSESSEKPAREFRAL 
SJlEEBS2SQSDELSsBKLHSvLFMARLCQSLGELCPHLKQClEG^ESEKPAREgRAL 
QSIEEGVQGQQDALNSAKLHSVLFMARLCQSLGELCPHLKQCILG^ESSEKPAREFRAL 
QS lEEGVQGQQDALNS AKLHS VLFMARLCQSLGELC PHLKQCI LGKS E S SEKPARE FRAlI 

QSIEEGVQGQQDALNSAKLHSVLFMARLCQSLGELCPHLKOCILGKSESSSKPAREFRAL 



RKQGKVKTQEIIPTQAKWQEVI^VLLQQSVMGYQVWSSAVVKVLIHGFTQSLLLDDAGSV 

RKQGPO^QEIIPTQAKWQEVKEVLLQQSVMGYQVWSSAVVKVLIHGFTQSLLLDDAGSV 

^KQGKESQgagpgQAgWQgVKEVLLQQSVMSYSTOSnARvKSLlEGFTaSLLLg 

RKQGKVKTQEIIPTQAKWQEVKEVLLQQSVMGYQVWSSAVVKVLIHGFTQSLLLDDAGSV 

RKQGKVKTQEXIPTQAKVJQEVKEVLLQQS\^GYQVWSSAVVKVLIHGFTQSLLLDDAGSV 
RKQGKVK^gEIIPTgAKWQEVK^ 



LATATSWDELEIQEEAESGSSVTSK^ 

LATATSWDELEIQEEAESGSSVTSKIRLPAQPSWYVQSFLFSLCQEINRVGGHALPKVTL 
LATATjgWDELEIQEaSESGSSVTSKIRLPgQPSWYVQSFLFSLCQEaNRVGGHALPKVTL 
LATATSWDELEIQEEAESGSSVTSKIRLPAQPSWYVQSFLFSLCQEINRVGGHALPKVTL 
LATATSWDEU^JIQSEA^SGSSV-TSKIRLPAQPSWYVQSFLFSLCQEIKRVGGHALPKVTL 
LATATSWDELEIQEEAESGSSVTSKIRLPAQPSWYVpSFLFSLCQEINRVGGHALPKVTT. 



IQEMLKSCMVQWAAYEKLSEEKQIKKEGAFPVTQNRALQLLYDLRYLNIVLTAKGDEVKS 
QEMLKSCMVQVVAAYEKLSEEKQIKKEGAFPVTQNRALQLLYDLRYLNIVLTAKGDEVKS 
QEMLgjCMSQVaAAYEgLSEEigQIKKEGAFPiSTQNRALQLLYDLRYLPE'VLEBKGS 
QEMLKSCiVIVQWAAYEKLSEEKQIKKEGAFPVTQNRALQLLYDLRYLNIVLTAKGDEVKS 
QEMLKSCMVQVVAAYEKLSEEKQIKKEGAFPVTQNRALQLLYDLRYLNIVLTAKGDEVKS 
QEMLKSCMVQVVAAYEKLSEEKQIKKEGAFPVTQNRALQLLYDLRYLISIIVLTAKGDRVK.q 



1 

289 
1 

359 
300 
349 
36 
349 
1 

419 
360 
409 
96 
409 
1 

479 
420 
4 67 
156 
469 
1 

539 
480 
527 
216 
529 
1 

598 
539 
587 
275 
588 
47 

658 
599 
647 
-335 
648 
107 * 

718 
659 
707 
395 
708 
167 

778 
719 
767 
455 
768 
227 

838 
779 
827 
515 
828 
287 



GRSKPDSRIEKVTDHLEALIDPFDLDV^ 

GRSKPDSRXEKVTDHLEALibPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTENQLAPR 
--cvranonfB^.,*^™ FTPHLNSNLglRLVQRTSVLFGLVTGTE ~ ^ 



29 



898 
839 

^FgSjg 887 
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20 



516 M:|jt;!4£]tlg4;|riAalk4 

829 
288 



gi 114776514 I 
gi I 72431431 
gi 1 11360291 1 

NOV3 

gi| 147765181 
gi 1 15011849 I 
gi 1 14776514 I 
gi 1 7243143 1 
git 113602911 

NOV3 

gi| 14776518 I 
gi 1 15011849) 
gi 1 14776514 I 
gi 1 7243143 1 



gi 1 11360291 1 406 gBv-RSI5 



■ DHLEALIDPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTENQLAP 
GRSKPDSRIEKVTDHLEALIDPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTENQLAPR 
gRSKPDSRiEKVTDHLEALIDPFDLDVFTPHLNSNLHRLVQR TSVLFGLVTGTENOLAPR 



« 575 
888 
347 



.899 
840 
888 
576 
889 
348 



s st fn s qe phn i lplas s qir 
sstfnsqephnilplassqirI 

SSTFNSQEPHNILPLASSQi: 

sstfnsqephnilplassqir 

SSTFNSQEPHNILPLASSQIR 
SSTF NSQEPHNILPLASSOIR 



iFGLLPLSMTSTRKAKST 
FGLLPLSMTSTRICAKST 
FGLLPLSMTSTRKA^T 
FGLLPLSMTSTRKAKST 
FGLLPLSMTSTRKAKST 
FGLLPLSMT S TRKAKS T 



RNIETKAQVGPPAgjS 

rnietkaqvgHaBs 

R^ETgAQVGPPAgsi^GDP 

rnietkaqvBppaBsRBgdp 

RNIETKAQVCBjAgs: 

rnietpcaqvBJppaSsI 



TAjggi 957 

K R 892 

RV^^ 946 
TA^ 633 

KR 941 

PABRS 405 




The amino acid sequence of NOV3 has high homology to other proteins as shown 
Table 3E. 



m 



Table 3E. BLASTX results for NO V3 



Sequences producing High-scoring Se gment Pairs 
Da to : AAB755fi7 dt^na ia ^ , — r- — r: 



patp;AAB75567 Gene 16 hum seer pro t homol aa Mus mus, 174"i:i 



Reading High 
Frame 
. . +3 



Smallest 
Sum 
Prob 
Sco re P(N) 

1.5e-77- 



789 



N 
1 



Low density Kpoprotein (LDL) particles are the major cholesterol carriers in 
circulation and thek physiological function is to carry cholesterol to the cells. In the process of 
atiierogenesis these particles are modified and they acc^^^ 

serum cholesterol bound to low density Kpoprotein (LDL) is a characteristic of famihal 
hj^ercholesterolemia. 

By studying cultured fibroblasts from homozygotes. Brown and Goldstein (Proc. Nat 
Acad. Sci. 70: 2804-2808,1973; Proc. Nat. Acad. Sci. 71: 788-792.1974) showed that the basic 
defect in patients suffereing from coronary artery disease and/or famihal hypercholesterolemia 
concerns the cell membi^e receptor for LDL. Normally. IDL is bound at the cell m«nbiane 
and taken into the cell ending rxp in lysosomes where the protein is degraded and the 
cholesterol is made available forrepression of microsomal enzyme S-hydroxy-S- 
methylglutaryl coenzyme A (HMG CoA) reductase, the rate-limiting step in cholesterol 
synfliesis. In the disease state, an internalization mutant of the LDL receptor binds LDL but is 
unable to faciUtate passage of LDL to the inside of the cell (Goldstein et al.. Cell 12:-629- 
641,1977). Along with the disease states discussed above. LDL has been impUcated in viral 
infection. Studies indicate that Hepatitis C virus (HCV), the principal viral cause of chronic 
hepatitis, and other viruses enter cells through the mediation of LDL receptors. The studies 
demonstrate that endocytosis of these viruses correlates with LDL receptor activity (Agnello et 
al., Proc. Nat. Acad. Sci. 96:12766-71. 1999). 

.. 30 
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The aboye deJtoed,pfom NOV3 protein may 

function as a member of a Low Density Lipoprotein B protein family. Therefore, the NOV3 
nucleic acids and proteins of the invention are useful in potential therapeutic and diagnostic 
applications. For example, a cDNA encoding the NOV3 protein may be useful in gene 
5 therapy, and the NOV3 protein may be useful whai administered to a subject in need thereof. 
By way of nonlimiting example, the compositions of the present invention will have efficacy 
for treatment of patients suffering from Familial hypCTcholesterolemia, hyperlipoproteinemia 
n phaiotype, tendinous xanthomas, corneal arcus, coronary artery disease, planar xantiiomas, 
webbed digits, hypercholesterolemia, fertihty, coronary artery disease, diabeties, 

10 atherosclerosis, xanthomatosis. Hepatitis C infection, regulation, synthesis, transport, 

recycling, or turnover of LDL receptors. Cerebral arteriopathy with subcortical infarcts and 
leukoencephalopathy. Epiphyseal dysplasia, multiple 1, Ichthyosis, nonlamellar and 
nonerythrodermic, congenital. Leukemia, T-cell acute lymphoblastoid, Pseudoachondroplasia, 
SCID, autosomal recessive, T-negative/B-positive type, C3 deficiency, Diabetes mellitus, 

15 ihsuhn-resistant, with acanthosis nigricans, Glutaricaciduria, type I, Hypothyroidism, 
congenital, Leprechaunism, Liposarcoma, Mucolipidosis IV, Persistent Mullerian duct 
syndrome, type I, Rabson-Mendehhall syndrome. Thyroid carcinoma, nonmedullaiy, with cell 
oxyphilia, Erythrocytosis, familial. Malaria; cerebral, susceptibility to. Bleeding disorder due 
to defective thromboxane A2 receptor. Cerebellar ataxia. Cayman type. Convulsions, familial; 

20 febrile, 2, Cyclic hematopoiesis, Fucosyltransferase-6 deficiency, GAMT deficiency. Von 
Hippel-Lindau (VHL) syndrome. Cirrhosis, Transplantation, Psoriasis, Actinic keratosis. 
Tuberous sclerosis. Acne, Hair growth, allopecia, pigmentation disorders and endocrine 
disorders. The NOV3 nucleic acid encoding Low Density Lipoprotein B-like protein, and the 
Low Density Lipoprotein B-like protein of the invention, or fi^gments thereof, may further be 
25 useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 

NOV4 

A disclosed NOV4 nucleic acid of 1851 nucleotides (designated CuraGen Acc. No. 
Aco26756_dal) encoding a novel Purinoceptor-like protein is shown in Table 4A. An open 
30 reading frame was identified beginning with an ATG initiation codon at nucleotides 347-349 
and ending with a TGA codon at nucleotides 1358-1360. Putative untranslated regions " 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 4A, and the start and stop codons are in bold letters. 
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Table 4A- NOV4 Nucleotide Sequence (SEQ ID NO:7)' 



ATCTTGGAATCCCAATCCGTGAGGTTC CTGGGTGTGCTGCrATCAGGACAGCGGTC nA^^ 
TACCCAAATATTGACATCCTGC AGC TAGCCTCAAACAATCAnAGCTAcSTCcSSnAn^ 



GGCTAAAATTGCTAATCCTGATGAAAATCAACAAAATAGAm 



TGATGAAAACATCCCACTCAAGATGCaCTACCTCCCTGTTATTTATGGCATTATCTTCCT^ 

$^S^^t^^°''''*'^^''^'^^^'='=^^^^^<^TTTTCAAAATGAGAC^^ 

^^^^^?^^CT<3GCCTGCACAGATCTGCTGTATCTGACaVGCCTCCCCTTCCTOAT?c^^^ 

^«^mS^^r£^^^^^°^'^'^^*^^^°^^^T^SCCTGTGCTGTGGTGTGGATCATrTCACTGGTAGCTGT 
$^™^^^*^^^''^^^'^^^^'=^^=^='=^CAGGACCAACAGATCAGCCTGTC^ 

GACTGCAAACTGACAGCTGCCT 



TGATAGTGACACTTTGCTATACCACGATTATCCACACTCTGACCCATGi 



^^SS^^^*^^A=<5^GSCTAACCATTCTGCTACTCCTTGCATTTTACGTATGTTTTTkcCC^^ 

atcttgagggtcattcggatcgaatctcgcctgctttcaatcagttgttcSttSStSS^ 



ACTTTACCTAGCATCCTAAGATGTTC AGG ATGTCTCrrTCAATGGAACTCCTG^^ 



AAGTAATCATGTV^ r^6CCAGGGCAr,A«C TTCTAGTTr::r^ 
GGGGAGATATAAGAAT6GGATGCATr,TAn.,,TCAGCAAAr,TAir:rnn>^^ 

CTCAGAGGCAT CTTAGAGAAC ATCTGTTCCCACCAACTTACTAT ^^^^'"■^^^^^"^g^^^TTn GAA 



CTTGCCCTAGATTGCTCAGTAAATT^ 



\TATATACACGGAAACCAATTTCTTACC 



The nucleic acid sequence of NOV4 has 419 of 717 bases (58%) identical to slMus 
m«^c«/t« P2 Y puiinocqjtor mR^ 

A NOV4 polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 is 337 amino acid 
residues and is presented using the one letter code in Table 4B. Signal P, Psort and/or 
Hydropathy results predict that NOV4 does not contain a signal peptide and is likely to be 
localized .at .^e plasma membrane with a certainty of O.6OO0; 



Table 4B. NOV4 protein sequence (SEQ ID NO-8) 




The NOV4 amino acid sequence has 1 12 of 306 amino acid residues (36%) identical 
to, and 179 of 306 residues (58o/o) positive with, ^Mus musoulus 373 amino acid residue P2Y1 
Purinoceptor protein (ATP Recq>tor) (ptnr:SWlSSPROT-ACC:P49650) (E = 9.46-56). 

NOV4 is expressed in at least the following tissues corresponding to the 20 original 
pooled cDNAs it was amplified from: adrenal gland, bone manow, brain - amygdala, biain - 
cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain - whole 
fetal brain, fetal kidney, fetal iiver, fetal lung, heart, kidney, lymphoma- Raji, xnammaiy ' 
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gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small 
intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. 

Possible small nucleotide polymorphisms (SNPs) found for GPCR4 are listed in Table 
4C. Depth represents the number of clones covering the region of the SNP; The putative allele 
5 frequence (PAF) is the fraction of these clones containing the SNP. A dash, when shown, 
means that a base is not present The sign means **is changed to." 



Table 4C: SNPs 


Consensus 


Depth 


Base 


PAF 


Position 




Chanse 




193 


39 


T>- 


0.051 


527 


33 


OT 


0.061 


591 


32 


OT 


0.062 


614 


38 


OT 


0.316 


721 


33 


T>- 


0.061 


823 


33 


A>G 


0.061 


929 


33 


G>A 


0.061 


1073 


33 


A>- 


0.061 



NOV4 also has homology to the amino acid sequences shown in the BLASTP data 
10 listed in Table 4D, 



Table 4D, BLAST results for NOV4 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


eil6679193ireflNP 03279 

Ml . 


purinergic recqjtor P2Y, 
(j-protein coupled 1; 
P2Y1 receptor [Mus 
musculus] 


373 


109/299 
(36%) 


176/299 
(58%) 


le-51 


Bil4505557|reflNP 00255 
Ml 


purinergic receptor P2Y, 
G-prctein coupled, 1 
[Homo sapiens] 


373 


108/299 
(36%) 


176/299 . 
(58%) 


5e-51 


eil2829680lsDlP79928IP2 
Y8 XENLA 


P2Y PURINOCEPTOR 
8 (P2Y8) pCenopus 
laevis] 


537 


104/283 
(36%) 


161/283 
(56%) ' • 


7e-51 


eill3526931siDlP49652lP2 
YR MELGA 


P2Y PURINOCEPTOR 
1 (ATP receptor) 
(P2Y1) (purinergic 
receptor) (6H1 orphan 
receptor) [Meleagris 
gallopavo] 


362 


106/299 
(35%), 


174/299 
(57%) 


7e-51 


fiil464327lsplP34996IP2Y 

R chick: 


P2Y PURINOCEPTOR 
1 (ATP receptor) 
'(P2Y1) (purinergic 
receptor) [Gallus gallus] 


362 


106/299 
(35%) 


174/299 
(57%) 


7e-51 



The homology of these sequences is shown graphically in the ClustalW analysis shovm 
in Table 4E. ' 
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Table 4E ClustalW Analysis of NOV4 

l)'NOV4(SEQlbN6:8) : 
(S EQIDNOy^ 032798.11 purinergic receptor P2y, G-protein coupled 1; P2Y1 receptor [Mus musculus] 

^ ^^505557\re^ 0Q2554. 1 1 purinergic receptor P2Y. G-protein coupled, 1 jHonio sapiens] (SEQ ID 

4) gi|2829680|sp|P79928|P2Y8 XF,NT.A P2Y puiinoceptor 8 (P2Y8) pCenopus laevis] (SEQ ID NO-52) 
^l j^\^^^^^^^\^\'P'^^6^2^2YR MF,T,GA P2Y puiinoceptor 1 (ATP receptor) CP2Y1) (purineigic receptor) 
(6H1 orphan receptor) [MeleagrisgaHopavo] (SEQ ID NO:53) . . 

t ^'Xff^QTNS4r' "' """"^ ' '"^^ 

NOV4 

gi I 6679193 I 
gi I 4505557 1 
gx(28296B0| 
gl 113526931 
g± 1464327 I 

NOV4 

gi I 6679193 1 
gi I 4505557 J 
gi I 2829680) 
gi 113526931 
gi 1464327 1 . 

N0V4 

gi I 66791931 
gi I 4505557 1 
gi( 2829680 I 
gi 1 1352693 1 
gi 14643271 

NOV4 

gi I 6679193 I 
gi I 4505557 1 
gi J 2829680 I 
gi 1 1352693 I 
gi I 464327 1 

NOV4 

gi I 66791931 
gi I 45055571'^ 
gi 12829680 I 
gi 11352693 I 
gi I 4643271 

NOV4 

gi I 6679193} 
gi I 4505557 I 
gi 1 2829680 f 
gi 113526931 
gi I 4643271 

NOV4 

gil 66791931 
gi I 45055571 
gi I 2829680 I 
gil 1352693 I 
gil 4643271 

NOV4 

gil 66791931 
gil 4505557 I 
gil 2829680 1 
gil 1352693 I 
gil 4643271 




363 

363 

408 RNSTDKNbiyi 
352 

352 ray 



pYVEWEKEDYETKRENRKTTEQSSKTNAEQDELQTQlDSI^ 4 67 

362 
362 

34 



BNSDOCID: <WO__02055702A2J_> 



wo 02/055702 




PCT/USOl/50925 



NOV4 337 : 337 

gi I 6679193 1 373 373 

gx I 4505557 I 373 373 

gi I 2829680 I 4 68 LKRGKWQLSSKKGAAQENEKGHMEPSFEGEGTSTWNLLTPKMYGKKDRLAKNVEEVGYGK 527 

gi 1 1352693 1 362 ^62 

gi I 464327 I 362 ' 3g2 

NOV4 337 337 

gi I 6679193 1 373 373 

gi I 4505557 1 373 373 

gi 1 2829680 1 528 EKELQNFPKA 537 

gi 1 1352693 1 362 362 

gi I 464327 I 362 362 



Table 4F lists the domain description from DOMAIN analysis results against NOV4. 
This indicates that the NOV4 sequence has properties similar to those of other proteins known 
to contain these domains. 



Table 4F. Domain Analysis of NOV4 



qnl I Pf am j pf amOOOOl . 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQIDNO:93) 

Length -» 254 residues, 100.0% aligned 

Score « 125 bits (315)^ Expect ° 3e'30 



NOV4 


50 


00001 


1 


NOV4 


110 


00001 


61 


NOV4 


170 


00001 


121 


NOV4 


230 


00001 


181 


NOV4' 


284 


00001 


236 



GNAWISTYIFKMRPWKSSTIIMLNIACTDLLYLTSLPFLIHYYASGENWIFGDFMCKF^ 109 

II +M + + +1 +111! Ill + l +11 II 1 +I + III +11 + ' 

GNLLVILVIIJITKKLRTPTOXFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALC^ 60 

.RFSFHFNLYSSILFLTCFSIFRYCVIIHPMSCFSIHKTRCAW^CAl^IISLVAVIPMT 169 
I I l+lll II MM I+II+ 1 III +II+++I+ +1 

GALFVVNGYASILLLTAISlbRYIAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 120 



+ 1 



I + + I 1+ I ) 1) I+++ +1 I I 1+ I I 



++ 



t I 



rQKARRLTILLLLAFYVCFLPFHILRVIRIESRLLSISCSiENQIHEA 283 
;+ 1 I ++ +++++ I+I + II + II+++ II + I 

LCLLSIWRVLPTA 235 



+++ I I +1+ I ++I 



10 



The amino acid sequence of NOV4 has high homology to other proteins as shown 
Table 4G. . > 



m 



Table 4G. BLASTX results for NOV4 



Sequence s producing High-scor ina Semnent Pairs: 

r^Ai-n . 7iTiXlf\A «;Q/i u. ^^r^-r^ ,^ . ^ — — ^ — : rr^^ 



patp:AAU04584 Human GPCR 3940, Homo Sapiens 337 aa..; 
patp:AAG8097l Human nGPCR54 #2 - Homo sapiens 337 aa. 



Reading High. 
Frame 



Smallest 
Sum 
Prob 
Score 



+2 
+2 



1764 
1601 



P(N) 



7.0e-181 
a.3e-163 
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The above defined ififonnation for NOV4=suggests that this NOV4 protem may 
function as a member of a purinoceptor-like protein fanaily. Therefore, the N0V4 nucleic 
acids and proteins of the invention are useful in potential therapeutic ^pUcatibns implicated in 
various diseases and disorders. The NOV4 nucleic acid encoding purinoceptor-like protein, 
and the purinoceptor-like protein of the invention, or fragments thereof, may further be useful 
in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein 
are to be assessed. 

NOVS 

NOV5 includes two novel CG8841-like proteins disclosed below. The disclosed 
proteins have been named NOV5a and NOV5b. 

NOVSa 

A disclosed NOV5a nucleic acid of 3146 nucleotides (also referred to as 
sggc_draft_dj895c5_2000081 l_dal) encoding a novel CG8841-like protein is shown in Table 
5A. An open reading frame was identified beginning with an ATG initiation codon at 
nucleotides 1-3 and ending with a TGA codon at nucleotides 2293-2295. A putative 
untranslated region downstream from the termination codon are underlined in Table 5A, and 
the start and stop codons are in bold letters. 



Table 5A. NOVSa Nucleotide Sequence (SEQ ID NO:9) 



ATGGGGTCGACCGACTCCAAGCTGflACTTCCGGftaGGCGGTG ATCCAGCTCftCCACCAAGACGCAGCgrrTrr 

^^^?^^^^^:^«^^sccttttgggaccagttctgggcagacacagccaSSSg^ 

GGTGCCGGCAGCAGAGATCCGGGCCGTGCGGGAAGAGTCACCCTCCAACTTGGCCACCCTGTGC^^^ 

^^^^^^2S:^°«^°cagggagctgagagtggctgccactcggagJ^§Ig^^ 

rrnnnno^^^''''°^'''^*''=^*=^^^^"^"^™'^<5SACCGCGACTGGAGGGGC?T^^^ 

^^^^*^'^^^'=<5^tggagctgctgaaactgctgctgacatgcttctccgaggcStgtacc^^ 
^^^^?^^^^'=^'=°«^cccctggtggaggaggctgcccaggtgctcattgtcactttggSSS 

^^S^^^f^^^S^CC^CTGTGGACGGCACCACCACTGGCACCGCCATGGATGATGCaGA^^ 

^^^!£^^^^^at*<5cccggctgctgtccaaccccctgctccagacctacctgcctSctSS 

CCAGTTCCACCAGGAGCTGCTAGTTCTCTTCTGGAAGCTCTGCGACTTCAACAAGA^^ 



?E!?^E^^'=^TACTCAATCCGCGTGCCCATGGAC^TCCCAGTaT??SS^™ 



^''^'''^•^^^^^'^'^'^'^'SGGCACCAGCGGTTGCAGCCCCTOT 
G-GA^^S^S-?— 

CACCCTGGTGGGGCTGCTGCCCGTGCCCC ACCCCATCCTCATCCGCA^^^ 
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ATGTGGTTCCGCACCTACATGTGGGGCGTCATCTATCTGAGGAATGTGGACCCCCCTGTCTGGTACGACACCG 

ACGTGAAGCTGTTTGAiGATACAGCGGGTGTG AGGATGAAGCCGACGAGGGGCTCAGTCTAGGGGAAGGCAGGG 

CCTTGGTCCCTGAGGCTTCCCCCATCCACCATTCTGA6CTTTAAATTACCACGATCAGGGCCTGGAACAGGCA 

6AGT6GCCCTGAGTGTCATGCCCTAGAGACCCCTGTGGCCAGGACAATGTGAACTGGCTCA6ATCCCCCTCAA 

CCCCTAGGCTGGACTCACAGGAGCCCCATCTCTGG6GCTAT6CCCCCACCAGAGACCACTGCCCCCAACACTC 

GGACTCCCTCTTTAAGAeCTGGCTCAGTGCTGGCCCCTCAGTGCCCACCCACTCCTGTGCTACCCAGCCCCAG 

AGGCAGAAGCGAAAATGGGTCACTGTGCCCTAAGGGGTTTGACCAGGGAACCACGGGCTGTCCCTTGAGGTGC 

CTGGACAGGGTAAGGGGGTGCTTCCAGCCTCCTAACCCAAAGCCAGCTGTTCCA6GCTCCAG6GGAAAAAGGT 

GTGGCCAGGCTGCTCCTCGAGGAGGCTG6GAGCTGGCCGACTGCAAAAGCCAGACTGGGGCACCTCCCGTATC 

CTTGGGGCATGGTGTGGGGTG6TGAGGGTCTCCT6CTATATTCTCCTGGATCCATGGAAATAGCCTGGCTCCC 

TCTTACC CAGTT^TGAGGGGCAGGGAAGGGAACTGGGAGGCAGCCGTTTAGTCCTCCCtGCCCTGCCCACTGC 

CTGGATGGGGCGATGCCACCCCTCATCCTTCACCCAGCTCTGGCCTCTGGGTCCCACCACCCAGCCCCCCGTG 

TCAGAACAATCTTTGCTCTGTACAATCGGCCTCTTTACAATAAAACCTCCTGCTCCAftAft AAaAAAAAAAAAA 
AAAAAAA ~" : 



The NOV5a nucleic acid was identiiBed on chromosome 17 and has 567 of 571 bases 
(99%) identical to a. Homo sapiens DKFZp434Il 120 mRNA (gbrGEIJBANK- 

ID:HSM802295|acc:AL137556) (E = l.le-2^<0 

A disclosed NOV5a polypeptide (SEQ ID NO:10) encoded by SEQ ID NO:9 is 764 
amino acid residues and is presented using the one-letter code in Table 5B. Signal P, Psort 
and/or Hydropathy results predict that NOV5a contains a signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0,7300 and the microbody (peroxisome) 
with a certainty of 0.6075. The most likely cleavage site for a NOV5a peptide is between 
amino acids 49 and 50, at: FAL- VP. 



Table SB. Encoded NOVSa protein sequence (SEQ ID NO:10) 



MGSTDSKLNFRKAVIQLTTKTQPVEATDDAFWDQFWADTATSVQDVFALVPAAEIRAVREESPSNLATLCYK 
AVEKLVQGAESGCHSEKEKQIVLNCSRLLTRVLPYIFEDPDWRGFFWSTVE-QQGEEDDEHARPLAESLLLAl 
ADLLFCPDFTVQSHRRSTVDSAEDVHSLDSCEYIWEAGVGFAHSPQPNyiHDMNRMELI«LLLTCFSEAMyL 
PPAPESGSTNPWVQFFCSTENRHALPLFTSLLNTVCAYDPVGYGIPYNHLLFSDYREPI.VEEAAQVLIVTLD 
HDSASSASPTVDGTTTGTAMDDADDFQFILKGIARLLSNPLLQTYLPNSTKKIQFHQELLVLFWKLCDFNKK 
FLFFVLKSSDVI,DILVPILFFLNDARADQSRVGLMHIGVFII,LIJ:.SGERNBX3VRI.NKPySIRVPMDIPVFTG 
THADLLIWFHKIITSGHQRLQPLFDCiaiTIVVNVSPYLKSLSMVTANKLLHLLEAFSTTWFLFSAAONHHL 
™^?ir'"^^°^°^'''^*'^^^^*^^^^^™Q"^5TDPPTIHKAI.QRRRRTPEPLSRTGSQEGTSME 
GSRPAAPAEPGTLKTSLVATPGIDKLTEKSQVSEDGTLRSLEPEPQQSLEDGSPAKGEPSQAWREQRRPSTS 
SASGQWSPTPEWVLSWKSKLPLQTIMRLLQVLVPQVEKICIDKGLTDESEILRFLQHGTLVGLLPVPHPILI 
RKYQaNSGTAMWFRTYMWGVIYLRNVDPPVWYDTDVKLFEIQRV • 



The NOV5a amino acid sequence has 397 of 638 amino acid residues (62%) identical 
to, and 478 of 638 amino acid residues (74%) similar to, SiDrosophila melanogaster 837 
amino acid residue CG8841 protein (ptm::SPTOEMBL-ACC:Q9V695).(E =5.9e"^^°). 

N0V5a is e:q>ressed in at least the foUowing tissues: adrenal gland/suprarenal ^and. 
amygdala^ bone marrow, brain, colon, dermis, dubdenum, hippocampus; hypothalamus, 
kidney, larynx, Uver, lung, lymph node, lymphoid tissue, mammary gland/breast, ovary, 
pancreas, parotid salivary glands, pituitary gland, retina, small Intestine, spinal chord, 
stomach, substantia nigra, testis, thalamus, tonsils, umbilical vein, uterus, whole organism. 
This information was derived by determining the tissue sources of the sequences that were 
included in the invention. In addition, the NOV5 A is predicted to be expressed in testis tissue 
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because of the expression pattern of a closely related Homo sapiens DKFZp434Il 120 mRNA 

(gb:GENBANK-ID:HSM802295|acc:AL137556). 

NOVSb ■ 

A disclosed NOV5b nucleic acid of 33.14 nucleotides (also referred to as CG54443-02) 
encoding a novel CG8841-like protein is shorn in Table 5C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides ?7-99 and ending with a 
TGA codon at nucleotides 2461-2463. Putative untranslated regions upstream from the 
initiation codon and downstream from the termination codon are underlined in Table 5C. and 
the start and stop codons are in bold letters. 



Table SC. NO V5I) Nucleolide Sequence (SEQ ID NO:ll) 
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The NOV5b nucleic acid was identified on chromosome 17 and has 1 155 of 1 162 
bases (99%) identical to a Homo sapiens DKFZp434Il 120 mRNA (gbiGENBANK- 
ID:HSM802295|acc:AL137556.1) (E= 1.2e""^) 
5 ' A disclosed NOV5b polypeptide (SEQ ID NO: 12) encoded by SEQ ID NO: 11 is 788 
amino acid residues and is presented using the one-letter code in Table 5D. Signal P, Psort 
and/or Hydropathy results predict that NOV5b contains a signal peptide and is likely to be 
localized to ttie plasma membrane wifli a certainty of 0.7300 and to the microbody 

(peroxisome) with a certainty of 0.6006. The most likely cleavage site for a N0V5b peptide is 
10 between amino acids 49 and 50, at: FAL-VP. 

Table 5D. Encoded NOVSb protein sequence (SEQ ED NO:12). 

MGSTDSKLNFRKAVIQLTTKTQPVEATDDAFWDQFWADTATSVQDVFALVPAAEIRAVREESPSNLATLCYKAVEK 

LVQGAESGCHSEKEKQIVLNCSRLLTRVi:.PYIFEDPDWRGFFWSTVPGAGRGGQGEEDDEHARPIAESLLLAIADL 

LFCPDFTVQSHRRSWDSAEDVHSLDSCEYlWEAGVGEMSPQP^ryIHDMNR^ffiIi^a^LTCFSEAMYLPPAPESG 

STNPWVQFFCSTENRHftLPLFTSLLNTVCAYDPVGYGIPYNHLLFSDTGEPLVEEAAQVLIVTLDHDSASSASPTV 

DGTTTGTAMDDADPPGPENLFVNYLSRIHREEDFQFILKGIARLLSNPLLQTYLPNSTKKIQFHQELLVLFWKLCD 

FNKKFLFFVLKSSDVLDILVPILFFLNDARADQSRVGLMHIGVFILLLLSGERNFGVRLNKPYSIRVPMDIPVFTG 

THADLLiyVFHKIITSGHQRLQPLFDCLLTIWNVSPYLKSLSMVTANKLLHLLEAFSTTWFLFSAAQNHHLVFFL " 

LEVFNNIIQYQFDGNSNLVYAIIRKRSIFHQLANLPTDPPTIHKALQRRRRTPEPLSRTGSQEGTSMEGSRPAAPA 

EPGTLKTSLVATPGIDKLTEKSQVSEDGTLRSLEPEPQQSLEDGSPAKGEPSQAWREQRRPSTSSASGQWSPTPEW ' 

VLSWKSKLPLQTIMRLLQVLVPQVEKICIDKGLTDESEILRFLQHGTLVGLLPVPHPILIRKYQANSGTAMWFRTY 
MWGVIYLRMVDPPVWYDTDVKLFEIQRV 

The NOVSb amino acid sequence has 409 of 662 amino acid residues (61 %) identical : . 
to, and 491 of 662 amino acid residues (74%) similar to, a Drosophila melanogaster 837 
amino acid residue CG8841 protein Cptiir:SPTREMBL-ACC:Q9V695) (E = 1.4e"^"). 

15 NOV5b is expressed in at least the following tissues: Adrenal Gland/Suprarenal gland. 

Bone Marrow, Brain, Cartilage, Colon, Demiis, Duodenum, GaU Bladder, Kicto^ 
Liver, Lung, Lymph node. Lymphoid tissue. Mammary gland/Breas^ Ovary, Panoreas, Parotid 
Sahvary glands. Pituitary Gland, Prostate,. Retina, Small Intestine, Spinal Cord, Spleen, 
Stomach, Testis, Tonsils, Urinary Bladder, Uterus, Vein, Vulva. In addition, this gene was 

:0 expressed in the following disease states: prostatic adenocarcinoma, ovarian carcinoma, colon 
carcinoma, uterine carcinoma, pancreatic adenocarcinoma, breast cancer. This information 
was derived by determining the tissue sources of the sequences that were included in the 



invention. 



25 



NOV5a and NOVSb are very closely homologous as is shown in the amino acid 
alignment in Table 5E. 

Table 5E Amino Acid Alignment of NOVSa and NOVSb 



10 



20 



30 



40 
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NOVSi) 



GSTDSKL 



TTKTQP 



WOVSa 
NOVSb 



TSVQDVFALVPAAEIRAVHEESPSNIATLCYKAVEKLVQG 
TSVQDVFALVPAaEIRAVREESPSNIATIiCYKAVEKLVO 



NOVSa 
NOVSb 



SGCHSEKEKQIVLNCSRLLTRVIiPYIFEDPDWRGFFWS 
AESGCHSEKEKQIVI.NCSRLLTRVX,pyxgEDPniJPr:P7rr.7Q, 



NOVSa 
NOVSb 



TVPataUJiBQGEEDDEHARPLAESLLIAIADLLFCPDF 

WPGAGRGGQQEEDDEHARPIA ESLIJLMADt.T.-prpnTr^^ 



NOVSa 
NOVSb 



QSHKRSTVDSAEDVHSLDSCEriWEAGVGEAHSPQPNyiH 
0SHRRSTVDSflEDVHSIJ3SCEriHEAGVeFAB.C!Pr.PMVTt.l 



NOVSa 
NOVSb 



MNFavEIJi^KLLLTCFSEAMyLPPAPESGSTNPWQ^^^^^ 
DMimMELLKLLLTCFSEAl^LPPAP ESGST^n:^ 



NOVSa 
NOVSb 



ENEHALPLFTSLLNTVCATOPVGyGIPXNHLLFSDtaHEPL 
ENRHai.PIJ^ SLIJgTV-CA3fDP VGrGIPyMm-.T:i7.;;n^r...T | 



NOVSa 
NOVSb 



EAAQVLIVTXJ^HDSASSASPrVDGTTTGrAMDD"^ 
S EAAQVLIVTLDHDSa^SASP rVDGTTTGTAMnnT^nppr^ 



NOVSa 
NOVSb 



FQPlLKGIARliliSNPLLQTYLPl^ 
)FQFn:.KGXARI.LSNPr.i:,QTri.PM' 



NOVSa 
NOVSb 



STKKIQPHQEI.I,VLFWM.UUt'WKKPI.PFVIJCSSDVl"L' 
STKKIOFHQELLVLFWKLCDFNKKFI.PWT.^acr..rTr,n.T- 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 



ptLFFr.iroflRADQSRVGrjffltGVFii.r.T:Tir;p.;T;;^',;T:: 



480 




roCt.LriVVm rSPrij;SI.S MVTAMKr.T.m.T.P.a,PQ.nrnr..nnT,^, 



550 



560 



530 540 . 



SAAQNHHLVFFLLEVFNNi IOYOFDSM.QWT.w: 



•AIIRKRSI 



^ 570 5S0 

. V 40 



590 600 
•I I.... I 





EGTSME 
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MOV5b 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



NOVSa 
NOVSb 



FHQIANLPTDPPTIHKALQRRE^TPEPLSRTGSQEGTSME 



I 



610 
. t . . 



620 



630 



640 
. I 



GSRPAAPAEPGTLKTSIiVATPGIDKIiTEKSQVSEDGTLRS 
GSRPAAPAEPGTIiKTSLVATPGIDKLTEKSQVSEDGTLRS 



650 



660 



670 



I 



680 
. 1 



LE PE PQQSLEDGS PAKGE PSQAWREQRRPS TSSASGQWS P 
LE PE PQQSLED GSPAKGE PSQAXSTREQRRPSTS SASGOWS £ 



690 



700 



1 



710 



720 



TPEWVIiSWKSKLPLQTIMRLLQVLVPQVEKICXDKGLTDE 
rPEWVLSWKSKLPLQTIMRLLQVI.VPQVEKICIDKGLTDE 



730 



740 750 760 
.1 I I I I 



SEILRFI.QHGTIiVGLIiPVPHPILIRKYQANSGTAMWFRTY 
SEtLRFLQHGTIiVGLLPVPHPILIRKYQANSGTAMWFRTY 



770 780 

.1 1 I I , 



MWGVI YLRNVD PPVWYDTDVKLFE IQRV 
MWGVXYLRNVDPPVWYDTDVKLFEIQR ' 



HomologiiBS to any of the above NOV5 proteins will be shared by the other N0V5 
protems insofar as they are homologous to each other as shown above. Any reference to 
NOV5 is assumed to refer to both of the NOV5 proteins in general, unless otherwise noted. 

NOVSa also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 5F. 





Table 5F- BLAST results for NOVSa 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


Eil7303477|gblAAF585^^ 
JJ (AE003822) 


CG8841 gene product 

[Drosqphila 

melanogaster] 


837 


519/844 
(61%) 


600/844 
(70%) 


0.0 


eil7505130|pirllT16522 


hypothetical protein 
K02E10.2 
[Caenorhabditis 
elegansl 


729 


422/782 
(53%) 


530/782 
(66%) 


0.0 


eilll360052birllT46395 


hypothetical protein 
DKFZp434I1120.1 
(fragment) [Homo 
sapiens] 


380 


328/354 . 
(92%) 


328/354 
(92%) 


0.0 


ail71061071eTOblCAB760 
33JJ(AL157917) 


conserved hypothetical 
protein 

[Schizosaccharomyces i 
pombe] 


767 


203/837 
(24%) 


360/837 
(42%) 


le-44 


Kil6648087lsDlO13776p/ 
E9G SCHPO 


hypothetical 104.8 
KD A protein 
C17A5.16 m 
CHROMOSOME I 
[Schizosaccharomyces 
pombel 


925 


166/679 " 
(24%) 


299/679 
(43%) 


3e-31 
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. The homology of these sequences is shown graphicaUy in the ClustalW analysis shown 
in Table 5G. 



Table 5G Clustal W Sequence Alignment 

1) NOV5a (SEQIDNO:10) 

NO:57) ^ "^^^^P^^^^-^l 120.1 (fragment) [Homo sapiem](^^^ 

N0V5A 

gi I 7303477 1 
gi I 7505130 I 
gi 1 11360052 I 
gi I 71061071 
gi| 66480871 



N0V5A 

gi I 7303477 1 
gi I 7505130 1 
gi 1 11360052 I 
gi I 7106107 I 
gi I 6648087 I 

NOV5A 

gi I 7303477 r 
gi I 7505130 I 
gi 1 11360052 I 
gi I 7106107 \ 
gi ( 6648087 | 

N0V5A 

gi I 7303477 1 
gi f 7505130 I 
gi 1 113600521 
gi I 7106107 r 
gi I 66480871'' 

• . , 

N0V5A 

gi I 7303477 1 
gi I 7505130 1 
gi 1 113600521 
gi I 7106107 1 
gi I 66480871 

N0V5A 

gi I 7303477 I 
gi I 7505130 I 

gi I 11360052 h 
gi| 7106107) 
gi I 6648087 J 




ItqpveatI 

QKIDPS 

tP GKD 

'QPVEAt| 

|heern^-xpk: 
"sqpp-- IP: 

QGAESGCHS! 
.QAVDSSCRTQJ 
iQFSRNHPATlBQi 
^'QGAESGCHsiEE^a:^^ 

DLQKETLFD^PAAPTTK 
iFALKNDKKFPNpgTAPAS: 



rSIjRKKRNLPKENSELDLSNFQDDLDFENJ 




Q DAAK 

RGG-G§EDDEJ 

PNINT^.__^.__ 

^SISQKNEFSj^SPSVPLSPVSTFPA^ 17? 

'Qsmi^wDSf^m^jEsiim 

:TATRI^GPEia^Edt|?#iir 

IT HaNGQKi: 

'QSHRRgTVDS. 



174 

169 

159 



VHGVH 



'^^ X78 




fah.s|qpn-yi[. 

FAQisHpHN-AHMBi^RR 
@SGN]^PMV-ALgY< 
EAHSgQPN-YiptDlS 
'rHP!mLKE-RSYELli 
'TETOEKTTKELA 



215 CAYVASIAN 
295 IiTYLTCVANkSIiI, 




ILPPAPESGSiTNEP 227 
|RSPQ-QSEEPifC 221 
" PVS— DETRLR 210 
|LPPAPESGS|rKp 231 

TDG NGkat 214 

iRSSE—^VAgi^T 294 



NOV5A 

gi I 73034771 
gi I 7S05130 \ 
gi 1113600521 
gi I 7106107 I 
gi I 6648087 1 

N0V5A 




mSAS SAS PS;VDGTTpGTAMDDAB- 
SD^QQQLTQPGQAteYDEGNcd^^ 
feQPNTDD^ J C3-Yll NYF 

355 »PHETTVEyFRQRLN'l,SPGAA^N-.--::Qfe 
328 gljLQT- 
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LF|VIJ<gSDVi||i|Jvpaj|jpija 385 
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gi I 7303477 I 
gi I 7505130 i 
gi 1 11360052 I 
gi I 7106107 I 
gi I 6648087 1 

NOV5A 

gi I 7303477 I 
gi 1 7505130 1 
gi 1 11360052 1 
gi I 7106107 J 
gi I 66480871 

NOV5A 

gi I 7303477 I 
gi I 7505130 I 
gi 1 11360052 I 
gi I 7106107 1 
gi I 6648087 1 

NOV5A 

gi I 7303477 I 
gi I 7505130 1 
gi 1 11360052 I 
gi 1 7106107 1 
gi I 6648087 I 

NOV5A 

gi I 7303477 I 
gi I 7505130 I 
gi 1 11360052 I 
gi I 7106107 I 
gi I 66480871 

NOV5A 

gi I 7303477 I 
gi I 7505130 I 
gi 1 11360052 I 
gi I 71061071 
gi I 6648087 1 

N0V5A- 
gi I 7303477 1 
gi 17505130 1 
gi 1 11360052 1 
gi I 7106107) 
gi I 66480871 

NOV5A 

gi I 73034771 
gi I 75051301 
gi 1 113600521 
gi I 71061071 
gi I 66480871 

NOV5A 

gi I 7303477 I 
gi I 7505130 1 
gi 1 113600521 
gi I 71061071 
gi) 66480871 

NOV5A 

gi I 7303477 I 
gi I 7505130 [ 
gi 1 11360052 I 
gi I 7106107 I 
gi I 66480871 . 



|liyQNr-BLPM| 
lIHSSSsHliPNST 



410 




iiLFfflKjlCD^ 



336 

315 

351 LLLQK- 

327 BMQST-IPKRSgLTMFDYYjPLyjJliFCKLFlHy] 
VjSAI SAgXS i VQ^PNIAFPgi II^'FLglgAI lk\ 




FH'ia.itlDTjDRAaiggFJ}] 
iRi^L^rgPYATErFgjTsj 



mrs 393 

ISDA 374 

•KK 374 

SFEY 385 

ALRY 469 




vfiwjjls 
vfiSHls 



-^PySlRVPJidiDPVFTgSH! 

feAYisjATVPM'DSpVFrSSHl 

— )kpyp!mcan3!n^ 



_3V 

375 i^KKKKK- 
386 LGgPSTTOHjliKii:^ 

470 HE#^EHsgi|yi^ 

,442 FHKXfirSGHQRLQPfflFDCtgTI' 

450 FHKia-ATGHQRLQpHFDCiiHTIIj^ 

431 IHKLljTTGNYRLQTfflFDCBBTll 

380 ~ 



GTglADgLI 

gtSadHli 



vy 44a 

BLgiTV 449 

— 380 

fSFTillA 4 45 
IS 529 





44 6 1 SlAiVQYTKDYHSE^iAQ^CCSECYljCLHAW^ 

530 sfhxsavkrspfssBsp\4i:|3l'i^c"-"" ' 



JSHSgQSHFEfflHiQSAgYPi 
'CASflMQHCSSLH^P 





380 

llgNDVffl 505 
IFRNPrH 589 



JpptihISalqrrrr — 559 

"".GIaHclSGRKTGG 569 
.SIAJITLSGRKS — 54 8 



:EKirv2i3Ai|gj^^ 
ytHqAijBsivEirogSi 



380 



!lgFFSMHi?DYIEA^V^JJSFgAIMSV3R^ 564 



TPEPLSRTG 568 

570 KFNLPRVPQRRTPAVSQELPSAHVPEEYNEEDEDEDEEEIINEEPKAEEDLESETESHER 629 
380 Z~~" KSANRD 554 



564 
648 



-SEGSSDFE- 



3QQ 

-AYWTRNGKTPSSKAFDSILLSR . 586 
SKSSDNTSLDGTPliQNTDF 675 



569 sBEGTSME(^PAAPAEP6TLKff^VAT 
630 SgAGELQSDVLTAQPAEPGTLKffSLLDT: 
555 EMVDQLKSPgSTAPPEIPAADAPmQTLi 

380 r 




KSQVSEDGT LRS 616 

REQAHPNDKPQVEDSTDI VPYDRS 689 
_„ _„„ 



587 
676 



380 



SKSPmPYYPIESSEFGmflfLKDVTSKDEi^ 646 
.TVEDD;|PFDELDKFSSPFiSS|SSSRGgLSHa:S:SRNVSISVPTVLQDVFSD SPLV 732 

617 LEPEPQQSLEDGj^PAKGEPSQAWREQRRPSTS SASGQi^smFmvimimmPlidSS 672 

690 AASTPTDERKSTSPTELSRLSVAHRASIRMVP GESDRWlTfflplilvisifei 745 

380 LAA^PALASMTGNVGNWEERPES- SQDNEWlAaQHAI^I^i^ 637 

647 KQRPQLHRAIiTEf ATLHGNDRSLEDTDEAKVEPIAHSVDYTFI^ 
733 LSRKLRGKIPENVSSSEIilKKCASNPFGKDLE iDSNLSlfl 



— 380 
706 
sBoSi 788 




iHPILlteYCjANSGTAM 731 
IPILiHkyUaNAGTTA 804 
;Piyi|gRYQl^IGTNH 696 
380 

•I^RQQLAN 762 
"^IWSVPMNN 844 

iiBSvDgpyWYD 753 
lEiRNSVEBAprVJYD 826 



380 ^-™™IilIIIl!liZ~^^ " ^fKrirogETO 718 

Jtlsfdekgij^tpslfttskvykqhgnimkvaspenssnsI^ 904 



754 


TDV 


K 


827 






719 


TDV 




380 





SIQRV- 
SIQRV- 
|EVQRA- 



789 



JlgDSF- 



764 
837 
729 
380 
797 



905 I^gDLYLpIiPSSVNHDSSLRNK 925 



43 



BNSDOCID: <WO_02055702A2J_> 



10 



20 



WG 02/055702 ^ ^ PGT/US01/S0925 

The amino acid sequence of NGV5, has high:homolo©^ to ofher pibteins as shown in 
Table 5H. ' . 



Table 5H. BLASTX results for NOV5 



Sntallest 
Sum 



sequences prod ucing High-scoring Segment Pai^.. ''^ ''^^'^KttfH scor^ 
patp.AAY91493 Secreted nrot aegu cene 43, Homo Saoi 214 aa.. +1 614 



P(N) N 

l.le-140 1 
6.3e-97 1 



The above defined information for NOV5 suggests that this NOV5 protein may 
function as a member of a CG8841-like protein family. Therefore, the NOVS nucleic acids 
andproteins of the invention are useful in potential therapeutic apphcations implicated in 
various diseases and disorders described below and/or other pathologies. For example, the 
NOV5 compositions of the present invention wiU have efficacy for treatment of patients 
suffering from cancer, trauma, immunological disease, respiratory disease.. gastro-intestinal 
diseases, reproductive health, neurological and neurodegenerative diseases, bone marrow 
transplantation, metabohc and endocrine diseases, allergy and inflammation, nephrolo^cal 
disorders, hematopoietic disorders or urinary system disorders. The NOV5 nucleic acid 
encoding CG8841-like protein, and the CG8841-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic apphcations. wherein the presence or amount of 
15 the nucleic acid or the protein are to be assessed. 



NOV6 



NOV6 includes two novel Syn^totagmin-like proteins disclosed below. Hie disclosed 
proteins have been named NOV6a and N0V6b. 



NOV6a 



A disclosed NOV6a nucleic acid of 1 1 16 nucleotides (also referred to as 
SC134912642_dal) encoding a novel Synaptotagmin-like protein is shown in Table 6A An 
open reading frame was identified begimiing with an ATG initiation codon at nucleotides 1 -3 

and ending with a TGAcodon at nucleotides 1114-1116. IHe start and stop codons ar^ in bold 
letters in Table 6A. 



Table 6A. NOV6a Nucleotide Sequence (SEQ ID NO:13) 
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TGACCGCTTCAGCCGCCACGACCCCATTGGGGAGGTGTCCATCCCCCTTAAACAGGTGGACCTGACCCAGATG 
CAGATCTG6AAGGATCTGAAGCCATGCAGCGATGGGAGTGGGAGCCGAGGGGAGCTGCTCTTGTCTCTCTGCT 
ACAACCCCTCTGCCAACTCCATCATCGTGAACATCATCAAAGCCCGGAACCTCAAAGCCATGC5ACATCGGGGG 
CACATCAGACCCCTACGTGAAGGTATGGCTGATGTACAAGGACAAGCGGGTGGAGAAGAAGAAGACGGTGACG 
ATGAAGAGGAACCTGAACCCCATCTTCAATGAGTCCTTCGCCTTCGATATCCCCACGGAGAA6CTGAGGGAGA 
CGACCATCATCATO^CTGTCaVTGGACAAGGACAAGCTCAGCCGCAATGACGTCaTCGGCAAGATCTACCTGTC 

CTGGAAGAGCGGGCCAGGGGAGGTGAAGCACTGGAAGGACATGATTGCCCGTCCCCGGCAGCCCGTGGCCCAG 

TGGCACCAGCTGAAGGCCTGA 



The NOV6a nucleic acid was identified on chromosome 1 lql2.2 and has 709 of 768 
bases (92%) identical to &Mus musculus synaptotagmin VU mRNA (gbrGENBANK- 
lD:AB026804|acc:AB026804) (E= 1.3e-^^. 

A disclosed N0V6a polypeptide (SEQ ID N0:14) encoded by SEQ ID NO:13 is 371 
amino acid residues and is preseoted using the one-letter code in Table 6B. Signal P, Psort 
and/or Hydropathy results predict that NOy6a contains a signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.8200. The most likely cleavage site for a 
NOV6a peptide is between amino acids 35 and 36, at: VLA-SR. 



Table 6B. Encoded NOV6a protein sequence (SEQ ID NO:14) 



MyRDPEAASPGAPSRD\^LVSMITVSLSVTWIASRCHWCQRKLGKRYKNSl^TVGTPDSGRGRSEKKAIK 

LPAGGKAVOTAPVPGQTPHOBSDRRTEPRSSFSDLVNSLTSEMOMESTLTVKIMKAQELPAKDFSGTSDPFV 

KIYLLPDKKHKLETKVKRKNLNPHWNETPLFEGFPYEKVVQRILYLQVLDYDRFSRHDPIGEVSIPLKQVDL 

TQMQ1WKDIJCPCSDGS6SRGELLLSI.CYNPSANSIIWIIKARNIJCAMDIGGTSDPYVKVWIMYKDKRVEKK 

KTVTMKRNLNPIFNESFAFDIPTEKLRETTIIITVMDKDKLSRNDVIGKIYLSWKSGPGEVKHWKDMIARPR 
QPVAQWHQIiKA 



The NOV6a amino acid sequence has 248 of 255 amino acid residues (97%) identical 
to, and 251 of 255 amino acid residues (98%) similar to, aJiattus norvegicus 403 amino acid 
residue synaptotagmin VII protein (ptm-:SPTREMBL-ACC:Q62747) (E = 1 3'^^. 

NOV6a is expressed in at least the following tissues: Adrenal Giand/Suprarenal gland. 
Bone, Brain, Cerebral MeduUa/Cerebral white matter. Heart, Hippocampus; Liver, Manmary 
gland/Breast, Pituitaiy Gland, Placenta, Salivary Glands, Thalamus. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not Kmited to SeqCalling sources, PubUc EST sources. Literature sources, and/or 
RACE sources. 



20 NOV6b 



A disclosed NOV6b nucleic acid of 1212 nucleotides (also referred to as CG56106-01) 
encoding a novel Synaptotagmin-like protein is shown in Table 6C. An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a 
TGA codon at nucleotides 1210-1212. The start and stop codons are in bold letters in Table 

6C. . • • . •■ ■ 



Table 6C. NOVdb Nucleotide Sequence (SEQ ID NO:15). 
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ATGTACCGGGACCCGGflGGCGGCCAGCCCaGGGGCGCeCTCGCGCGACGT.CCTGCTGGTCTCTGCCATCa 

TCACCGTCAGCCTTAGCGTCACTGTaSTCCTCTGCGGCCTCTGCCACTGOTGTCaMSCGC^ 

ACGCTACAAGAATTCCTTGGAGACGGTGGGCACGCCASACTCAGGGCGTGGGCGCAGTGAGflAGAAGGCT 

ATCAAGTTGCCTGCAGGA6GGAAGGCGGTGAACACAGCCCCCGTGCCAGGCCAGACACCCCACGATGAGT 

CCGACCGCCGGACCGAGCCACGTTCCTCCGTCTCAGACCTCGTCAaCTCCCTCACCAGCGAGATGCTCAT 

GCTCTCCCCAGGCTCCGAGGAGGATGAGGCCCACGAGGGTTGCAGCCGAGAGAACCTGGGCCGGATCCAG 

TTCaGTGTCGGCTACAACTTCCAGGAGTCCACGCTCACCGTGAAGATCATGAAGGCCCAGGAGCTGCCGG 

CCAAGGACTTCAGCGGCACCAGCGACCCCTTCGTCAAGATCTACCTGCTGCCCGACAAGAAGCACAAGCT 

GGAGACCflAGGTGAAGCGGAAGAACCTGAACCCCCACTGGAACGAGACCTTCCTCTTTGAAGGTTTTCCC 

TATGAGAAGGTGGTGCAGAGGATCCTCTACCTCCAAGTCCTGGACTATGACCGCTTCAGCCGCAACGACC 

CCATTGGGGAGGTGTCCATCCCCCTTAACAAGGTGGACCTGACCCAGATGCAGACCTTCTGGAAGGATCT 

GAAGCCATGCAGCGATGGGAGTGGGAGCCGAGGGGAGCTGCTCTTGTCTCTCTGCTACAACCCCTCTGCC 

AACTCCATCATCGTGAACATCATCAAAGCCCGGAACCTCAAAGCCATGGACATCGGGGGCACATCAGACC 

CCTACGTGAAGGTATGGCTGATGTACAAGGACAAGCGGGTGGAGAAGAAGAAGACGGTGAC6ATGAAGAG 

GAACCTGAACCCCATCTTCAATGAGTCCTTCGCCTTCGATATCCCCACGGAGAAGCTGAGGGAGACGACC 

ATCATCATCACTGTCATGGACAAGGACAAGCTCAGCCGCAATGACGTCATCGGCAAGATCTACCTGTCCT 

GGAAGAGCGGGCCAGGGGAGGTGAAGCACTG6AAGGACATGATTGCCCGTCCCCGGC3VGCCCGTGGCCCA 
GTGGCACCAGCTGAAGGCCTOA. ^-<---i,itotti,uwi 



The NOV6b nucleic acid was identified on chromosome 1 lql2-13.1 and has 1201 of 
1212 bases (99%) identical to a Homo sapiens synaptotagmin VH mRNA (gbrGENBANK- 
ID:AF038535(acc:AF038535.1) (E = 5.6e"^") 

A disclosed NOV6b polypeptide (SEQ ID NO:16) encoded by SEQ ID NO:15 is 403 
amino acid residues and is presented using the one-letter code in Table 6D. Signal P, Psort 
and/or Hydropathy results predict that NOV6b contains a signal peptide and is likely to be 
localized in the endoplasmic reticulum (membrane) with a certainty of 0.8200 and the plasma 
membrane with a certainty of 0.5 140. The most likely cleavage site for a N0V6b peptide is 
between amino acids 46 and 47, at: KLG-KR. 



Table 6D. Encoded NOV6b protein sequence (SEQ ID NOa6). 



^f.S^^^^^^'^^^^^^^^^'^^^^^^'^^^^^^^^^Q^K^^St ^yKNSLETVGTPDSGRGRSEKKAIK^ 
GGKAVNTAPVPGQTPHDESDRRTEPRSSVSDLVNSLTSEMI^LSPGSEEDEAHEGCSRENLGRIOFS^^TOES 

YDRFSRNDPIGEVSIPLNKVDLTQMQTFWKDKCPCSDGSGSRGELLLSLCYNPSANSIIVNIlSJ^ 



The N0V6b amino acid sequence has 398 of 403 amino acid residues (98%) identical 
to, and 401 of 403 amino acid residues (99%) similar to, a Rattus norvegicus aiiiino acid 
residue synaptotagmin VH protem (ptnr:SPTEiEMBL-ACC:Q62747j (E = 7.1e"^^^). 

^^^^ is expressed in at least the following tissues: Adrenal Gland'Suprarenal gland. 
Bone, Brain, Cerebral Medulla/Cerebral. white matter. Heart, Hippocampus, liver. Mammary' 
gland/Breast, Pituitary Gland, Placenta, Salivary, Glands, Tl^amus. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, PubUc EST sources, Uterature sources, and/or 
RACE sources. ' \ '. . • . 

• ■'^°^^^^^^°^^^^®^®^°lose].yhompto^^ 
alignmisnt in Table 6E. 
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NOV6a 
N0V6b 



N0V6a 
NOV 6b 



N0V6a 
NOV6b 



N0V6a 
NOV6b 



NOV6a 
NOV6b 



NOV6a 
N0V6b 



N0V6a 
NOV 6b 



NOV6a 
NOV6b 



PCT/USOl/50925 
Table 6E Amino Acid Alignment of NOV6a and NOV6b 



10 

.|....|, 



20 
. I . . 



MYRDPEAASPGAPSRDVLLVSAIITVSLSVTVVL? 
MYRDPEAASPGAPSRDVLLVSAIITVSLSVTVV 



30 

I .... I . . 
2CGI 



40 50 



m 



60 

. 1 . . 



70 



80 



90 

. 1 . . 



100 



NSLETVGTPDSGRGRSEKKAIKLPAGGKAVNTAPVPGQTPHDESDRRTEP 
NSLSTVGTPDSGRGRSEKKAlKLPAGGKAVNTAPVPGOTPHDESnRPTF.P 



110 120 130 



140 



RS SgS DLVN S LTS EMLMgS 
RS sBs DL VN S LTS EMLmSs 



150 
..I 



PGSEEDEAHEGCSRENIiGRIQFSVGYNFQES 



160 170 180 190 
■ ■ I 1 I 1 I 1 1 . 



200 



TLTVKIMKAQELPAKDFSGTSDPFVKIYLLPDKKHKLETKVKRKNLNPHW 
TLTVKlMKAQELPAKDFSGTSDPFVKIYLLPDKKHKI.RTTsV/rcRrcfar.MPMhi 



210 



220 230 
. . I I 1 I 



240 



250 



NETFLFEGFPYEKVVQRILYLQVLDYDRFSRgDPIGEVSIPLf^VDLTQri 
^ETFLFEGFPYEKVVQRILYLQVLDYDRFS RgJDPIGEVSr PlBS VDLTQE^ 



260 



270 280 

. 1 I I I 1 . 



290 



300 



^^KDLKPCSDGSGSRGELLLSLCYWPSAWSIIV^TIIKAR1^ILKAMDIGG 
WKDLKPCSDGSGSRGELLLSLC YMPSAHSIIVNITKARMT.KnMnTcr: 



310 



320 
. . I . . . . 1 . 



330 



340 



350 



TSDPYVKVWLMYE-CDKRVEKKKTVTMKRNLWPIFWESFAFDIPTEKLRETT 
TSDPYVKVMLMYKDKRVEKKKTVTMKRNLlvrPIFHESFAFnTPTr.K'T.Pi7TT 



360 370 
I I .... I L 



380 



390 400 
. I I I 



IIITVMDKDKLSRI^DVIGKIYLSWKSGPGEVKHWKDMIARPRQPVAQWHQ 
IIITWlDrCDKLSRt^DVIGKIYLSCiUvSGPGEVKHWKDMIARPRQPVAQl<?HQ; 



NOV6a 
NOV6b 



Homologies to any of the above NOV6 proteins wiU be shared by the other NOV6 
proteins insofar as they are homologous to each other as shown above. Any reference to 
NOV6 is assumed to refer to both of the NOV6 proteins in general, unless otherwise noted. 

NOV6a also has homology to the amino acid sequences shown in the BLASTP data 
Usted in Table.6F. 





Table 6F. BLAST results for NOV6a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 

(%) 


Positives 

^ (%) 


Expect 


gi 1 11067375 1 ref INP 
067691.11 


synaptotagmin 7 
[Rattus 
norvegicus] 


403 


358/403 
(88%) 


363/403 
(89%) - 


0.0 


gil 9055364 1 ref INP 0 
61271,11 

"gi 127241261 Qb|AAB92 


synaptotagmin 7 
[Mus musculus] 


403 


356/403 
(88%) 


362/403 
(89%) 


0.0 


667.11 (AF038535) 


synaptotagmin VII 
fHomo sapiens] 


418 . 


350/403 
(86%) 


356/403 
(87%) 


0.0 


gil 12667 450 IcTblAAKO 
1451.1IAF336856 1 
(AF336856) 


synaptotagmin 
Vila fRattus 
norvegicus] 


520 


296/351 
(84%) 


304/351 
(86%) 


le-159 



47 
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gi 112667458 | qblAAKQ r 
1455.1 IAF336860 1 
(AF33686Q) 



synaptoliagmin 
Vile [Rattus 
norvegicus] 



...643 . 


296/351 


304/351 


le-159 




(84%) 


(86%) 





The homology of these sequences is shown graphicaUy in the ClustalW analysis shown 
in Table 6G. 

Table 6G Information for the ClustalW proteins 

l)NOV6a(SEQroNO:14) 

?l 067691.11 synaptotagmin 7 [Rattus norvegicus] (SEQ K) NO-60> 

3) gi 9055364|reflNP 061271.1 1 synaptotagmin 7 [Miis musculus] (SEP ID NQ'6n ' 

^^T NEUROLYSIN PRECURSOR (NEUROTENSIN ENDOPEPTIDASm 
(MITOCHONDRIAL OUGOPEPTIDASE M) [Rattus norvegicus] (SEQ m No'62) 
S) ^m67450\,h \A A^^ (AF336856) sjaiaptotagnrin VHa [Ratios norvegicus] (SEQ ID 

^^^^'^^'^ ^^^^^^-.^ I AF^^6860 J (AF336860) synaptotagnzin VHe [Rattus norvegicus] (SEQ ID 



M0V6A 1 

gi 111067375 1 1 

gi I 9055364) 1 

gi 1 2724126 1 1 

gi 1 12667450 I 1 

gil 12667458 1 1 

NOV6A 45 

. gi 1 11067375 1 45 

gi 19055364 I 45 

gil 2724126/ 60 

gil 126674501 45 

gi 1 12667458 1 45 

NOV6A 72 — 

gil 110673751 72 — 

gi I 9055364 1 72 — 

gil 2724126 1 87 — 
gil 12667450 1 
gil 126674581 

N0V6A 

gil 11067375 1 
gil 9055364 I > 
gil 2724126 1 
gi I 12667450 I 
gil 12667458 1 

NOV6A 72 

gi 1 11067375 1 72 

gil 9055364 I 72 

gif 2724126 1 87 

gi 112667450 I 147 

gil 12667458 1 225 

NOV6A 72 

gi 111067375 1 72 

gil 9055364 I 72 

gil 27241261 87 
gil 12667450 1 
gil 12667458 1 

NOV6A 105 

gil 11067375 I 105 

gil 9055364 I 105 

gil 2724126 I 120 

gi 112667450 I 222 



AGYLQEPGXXLSXXGT 



MYRDPEAASPGAPgRDVLLVSAIITVSLSVT 

myrdpeaaspgapHrdvllvsaiitvslsvt 
myrp peaaspgapar dvllvsai 1tvslsvt\ 

MYRDPj22^G3gSDVLLVSAIITVSLSVT 

myrdpeaaspgapSrdvllvsaiitvslsvt: 
m^dpeaaspgapSJrdvllvsaiitvslsvt 




ilgkrypcnsletvgtpdsgrgrgekkai 
lgkryknsletvgtpdsgrgrHekkai 
lgkryrasletvgtpdsgrgrhekkai 
lgkryknsletvgtpdsgrgrBekkai 
lgkryknsletvgtpdsgrgrHekkai 
lgkryknsletvgtpdsgrgfSekkai/, 



ngtllsg; 



72 

^ _ 72 

. 72 

_ Q7 

/ATAAAGLAVEREGRLGEKPAPVP 104 
VATAAAGLAVEREGRLGEKPAPVP 104 



72 
72 
72 
87 



105 ppgedalrsggaapsepgssgkagrgrwrmvqshlaagklnls ZZZZZZZ i>,-7 

105 ppgedalrsggaapsepgssgkagrgrwrmvqshlaagklnlsnfedstlstattlesip 1 



72 

.72 

72 

87 

147 • 

165 



SSAGEPKCQRPRTLMRQQSLQQPLSQNQRGRQPSQPTTSQSLGQLQi 



64 



72 
72 
72 
87 

147 



IAHAASAPGSNPRAY 224 



GRGQARQGTSAGSKYRAAGGRSRSNPGSWDHWGQIRNRGLdMKSFI^^ 



72 
72 
72 
87 

KEGRMWLSLVLGL 161 
284 



162 seqddfanipdlqnpgtqqnqnaqgdkr 

285 SEQDDFANIPDLQNPGTQQNQNAQGDJ 



ilpaggkavntapvpgqtphdesdrrteprssB 
lpaggkavntapvpgqtphdesdrrteprssv 
lpaggkavntapvpgotphdesdrrteHrssv 
lpaggkavntapvpgqtphdesdrrteprssv 
lpaggkavntapvpgqtphdesdrrteprssv 
gglpaggkavntapvpgqtphdesdrrteprssv 



104 
104 
104 
119 
221 
344 
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al 1 12667458 1 


345 


NOV6A 


134 


gl 1110673751 


165 


gl 1 9055364 1 


165 


gl 127241261 


180 


al 1 126674501 




gl 1126674581 


405 


NOV6A 


X 7 *x 


gl 1 11067375 1 


99^ 


gi 190553641 


225 


gi|2724126| 


24 0 


ai 1 12667450 1 




ai 1 12667458 1 


Do 


NOV6A 




ai 1 11067375 1 


^ o o 


gi 190553641 


285 


gl 127241261 


300 


ai 1 12667450 1 




ai 1 12667458 1 




NOV6A 


"5 1 "5 

JU. .3 


ai 1 11067375 1 


"5 /I 
J fi O 


gi 19055364 1 


345 


gi 127241261 


360 


gl 1126674501 


4 62 


glU2667458| 


585 
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SDLVNSLTSEMLMLSPGSEEDEAHEGCSRENLGRIQFSVGYNFOESTLTVK 



v l>Mj^iWatjl5g! 404 



KDFSGTSDPEVKIYLLPDKKHKLETKVKRKNLNPHWNETFLFEGFPYEKVVQRILYLQVL 
KDFSGTSDPFVKIYLLPDKKHKLETKVKRKNLNPHWNETFLFEGFPYEKVVQRILYLQVL 

kdfsgtsdpfvkiyllpdkkhkletkvkrknlnphwnetflfegfpyekwqrBlylqvlI 
kdfsgtsdpfvkiyllpdkkhklStkvkrknlnphwnetflfegfpyekwqrilylqvlI 
kdfsgtsdpfvkiyllpdkkhkletkvkrknlnphwnetflfegfpyeicvvqrilylqvl 

KDFSGTSDPFVKIYLLPDKKHKLETKVKRKNLMPHWMETFLFEGFPYEKVVORILYLOVLi 



DYDRFSRaoPIGEVSIPLl^VDLTQMQaUWKDLKPCSDGSGSRGELLLSLCYNPSANSII 
DYDRFSRNDPIGEVSIPLNKVDLTQMQTFWKDLKPCSDGSGSRGELLLSLCYNPSANSII 
DYDRFSRNDPIGEVSIPLNKVDLTQMQTFWKDLKPCSDGSGSRGELLLSLCYlxIPSANSII 
DYDRFSRNDPIGEVSIPLNKVDLTQMQTFtJKDLKPCSDGSGSRGELLLSLCYNPSANSII 
DYDRFSRNDPIGEVSIPLNKVDLTQMQTFWKDLKPCSDGSGSRGELLLSLCYNPSAHSII 
DYDRFSRNDPIGEVSIPLWKVDLTQMQTFWKDLKPCSDGSGSRGELLLSLCYNPSANSIll 



VNIIKARNLrtflMDIGGTSDPYVKVWLMYKDKRVEKKKTVTBKRNLNPIFNESFAFDIPTE 
VNIIKARNLKAMDIGGTSDPYVIWWLMYKDKRVEKKKTVTgKRNLNPIFNESFAFDIPTE 

vniikarnlkamdiggtsdpyvkvwlmykdkrvei<kktvtBkrnlnpifnesfafdipte 

VNIIKJUy^LI^AMDIGGTSDPYVKVWLMYKDKRVEI^KKTVTBKRNLNPiSFNESFAFDIPTE 

vniikarnlkamdiggtsdpyvkvwlmykdkrvekkktvtBkrnlnpifnesfafdipteI 

VNIIKARNLKAMDIGGTSDPYVramT.Mvr<rnK-D\7irTftt-tri7'rHr^r,.,TT „,T^-r — , „ 



193 
224 
224 
239 
341 
464 

252 
284 
284 
299 
401 
524 

312 
344 
344 
359 
461 
584 



KLRETTIIITVMDKDKLSRNDVIGKIYLSWKSGPGEVKHWKDMIARPRQPVAQWHQLKA 
KLRETTIIITVMDKDKLSRNDVIGKIYLSWKSGPGEVKHWKDMIARPRQPVAQWHQLKA 
!!i'??r"^^™°^'^^^^'^°^^'^^^^^S"KSGPGEVKHWKDMIARPRQPVAQWHQLI<A 

klrettiiittodkdklsrndvigkiylswksgpgevEhwkdmiarprqpvaqwhqlka 

,^^'^5^J?^?^^°K^I<I'SRNDVIGKIYLSWKSGPGEVKHWKDI«IARPRQPVAQWHQLKA 
KLR^TTIIITVMDKDKLSRNDV-IGKrYLSWKSGPGEVKHW KDMIARPROPVAOWHnT.KAj 



Table 6H-6K lists the domain description from DOMAIN analysis results against 
NOV6a. This indicates that the NOV6a sequence has properties similar to those of other 
proteins known to contain this domain. 



371 
403 
403 
418 
520 
643 



Table 6H. Domain Analysis of NO V6a 

gnl|smart|smart0023q , C2, Protein kinase C conserved region 2 (CalB) ; 
SS^;?;^ ^""^ "^"^^^^ ^^^^^""'^ Phospholipases. protein kinases crind 
b^di^a^^' ^T"l^ ^PP«^^ to contain Ca2;. 

binding sites. Particular 025 appear to bind phospholipids, inositol 
S^rin^^^^ intracellular proteins. Unusual occurr;nce in 

; Synaptotagmin and PLC C2s are permuted in sequence with 
respect to N~ and C-terminal beta strands. SMART detects 02 domains 
using one or both of two profiles.. (SEQ ID NO: 94) ^o«»ains , 

Length = 101 residues;. 99.0% aligned ' 
Score = 103 bits (258), Expect - le-23 



N0V6a 120 

00239 a 

NOV6a 180 

00239 . 61. 



TLTVKIMKAQELPAK^ 179 

llll+ll+I Ml Mill M 1 

TLTVKIISARNLPPKDKGGKSDPyVKVSLDGDPREKKKTKN^TL^ ' 60 



YEKWQRILYLQVLDYDRFSRHDPIGEVSIPLKQVDLTQMQIW 
i I ++! I I I I I I I ! I 1 + 1 I I +1 

PE-.rr.LSELEIEVYDKDRFSRDDFIGRVTIPLSDLLLGGRHEK 



222 



100 



Table 61, Domain Analysis of NOV6a 

gnl I Smart I smart00239 (SEQ ID NO: 94) 
Length * 101 residues, 96.0% aligned 
Score =91.3 bits {225), Expect - 9e-20 



NOVSa ' 53 



LETVGTPDSGRGRSEKKAIKLPAGGKAVNTAPVPGQTPHDESDRRTEPRSSFSDLVNSLT 112 

49 
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00239 


296 


N0V6a 


113 


00239 


353 


NOV6a 


142 


00239 


413 


NOV6a 


202 


00239 


473 


N0V6a 


261 


00239 


533 


N0V6a 


321 


00239 


593 



1+11+'+ +1— +! I 1 I 1 i I I I I It 1 1 I I ) I 1 I i 11 M 1 11 111. il.lJ 1 t 1 I 1. . 
LQNPGTQQNQNAQGDK RLPAGGKAVNTAPVPGQTPHDESDRRTEPRSSVSDLVNSLT 352 

SEMIiM ESTLTVKIMKAQELPAKDFSGTSD 141 

Mill Illllll + IIMIIIIIIIIIIII . 

SEMLMLSPGSEEDEAHEGCSRENLGRIQFSVGYNFQESTLTVKVMKAQELPAKDFSGTSD 41*2 
PPVKI YLLPDKKHKLETKVKRKNLNPHWNETFLFEGFPYEKVVQRIIiYLQVLDYDRF 201 

I li 11 1 1 1 1 1 1! 1 1 1 1 1! I n 1 1 1 1 1 II 1 1 II 1 1 n II 1 11 11 1 1 1 11 n II 1 1 1 1 1 M+ 

PFVKIYLLPDKKHKLETKVKRKNLNPHWNETFLFEGFPYEKVVQRILYLQVLDYDRFSR'N 472 
DPIGEVSIPLKQVDLTQMQ-IWKDLKPCSDGSGSRGELLLSLC^PSANSIIVNIIKARN 260 

II M I N M i +M 11 N i i II 11 I M I I II M II tl M 11 { 1! I M I 11 ! II ( (1 M 

DPIGEVSIPLNKVDLTQMQTFWKDLKPCSDGSGSRGELLLSLCYNPSANSIIVNIIKARN 532 
LKAMDIGGTSDPYVKVWLMYKDKRVEKKKTVTMKRNIJyJPIFNESFAF 320 

IlinilllllllllllllinillllllMI IMMIMllllllMlllllllllll 

LKAMDIGGTSDPYVKVWLtryKDKRVEKKKTVTKKRNLNPIFNESFAFDIOT 592 

ITVMDKDKLSRNDVIGKIYLSWKSGPGEVKHWKDMIARPRQPVAQWHQLKA 371 
) I 1 ! I I II 1 II III ) I I) I i I I ) I I I I 1 )) 1 I I II I M i II I II 1 I t I I II 
ITVMDKDKLSRNDVIGKIYLSWKSGPGEVKHWKDMIARPRQPVAQWHQLKA 643 



Table 6 J. Domain Analysis of NOV6a 



gnl 1 Pf ami pfamOOl 68 , C2, C2 domain. (SEQ ID KO:95) 
Length = 88 residues, 98.9% aligned 
Score = 98-6 bits (244), Expect * 6e-22 



N0V6a 121 I.TVKIMKAC2ELPAKDFSGTSDPFVKIYLLPDKK — HKLETKVKRKNLNPHWNETFLFEGF 178 

1111++ 1+ II I +1 111+11+ III ! +11 +1 IN lilll+ll . 
00168 1 LTVKVISARNLPKMDMNGLSDPYVKVDLDGDPKDTEOKFECTKTVKKTLNPVWNETPVFEK^ 60 

N0V6a 179 PYEKWQRILYLQVLDYDRFSRHDPIGEV 207 

• I I I I M I I I I I l + l 

00168 61 PLPD — LASLRFAVYDEDRFSRDDFIGQV 87 



Table 6K Domain Analysis of NOV6a 

gnl I Pf am j pf aift00168 , C2, C2 domain. (SEQ ID NO: 95) 
Length = 88. residues, 96.6% aligned 
Score ^88.6 bits (218), Expect = 6e-19 



N0V6a 


251 


IIVNIIKARNLKAMDIGGTSDPYVKWLMyKDKRVEKKKTVTMKRNLNPIFNESFAFD--I 


309 






+ 1 +1 nil- 11+ 1 Minn i i +i n 1+1+ ni++ii+i 1+ + 




00168 


1 


LTVKVISARNLPKMDmGLSDPYVKVDLDGDPKDTKKFKTKTVKKTLNPVWNETFVFEKV 


60 


N0V6a 


310 " 


PTEKLRETTIIITVMDKDKLSRNDVTG 336 








1 1 + 1 1 + 1+ M + l 1 ! ' 




00168 


61 


PLPDIAS — LRFAVYDEDRFSRDDFIG 85 





5 . ... • 

Synaptotagmins are a family of brain-specific calcium-dependent phospholipid-binding 
proteins that play a role in synaptic exocytosis and neurotransmitter release. While 
constructing a transcript map of the human chromosomal 1 1 ql3 interval associated with Best 
vitelliform macular dystrophy, Cooper et al. isolated cDNAs encoding the human homolog of 
10 rat synaptotagmin Vn (Cooper et al.. Genomics 49: 419-429, 1998). The predicted 403-amiao 

50 
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acid human and rat proteins are 98% identical. Northern blot analysis revealed that 
synaptotagmin VII is expressed as 4.4- and 7.5-kb mJRNAs in a variety of human adult and 
fetal tissues, including those from different regions of the brain. 

Neurons release neurotransmitters by calcium-dependait exocytosis of synaptic 
vesicles. Brose et al. reported that synaptotagmin, a highly conserved synstptic vesicle protein, 
binds calcium at physiological concentrations in a complex with negatively charged 
phospholipids.(Biose et al.. Science 256:1021-1025, 1992). This binding is specific for 
calcium and involves the cytoplasmic domain of synaptotagmin. Calcium bindmg is dependent 
on the intact oligomeric structure of synaptotagmin; it is aboUshed by proteolytic cleavage at a 
single site. Brpse et al. (1992) interpreted the results as suggesting that synaptotagmin acts as a 
cooperative calcium receptor in exocytosis. Synaptotagmin contains 2 copies of a sequence 
that is homologous to the regulatory region of protein kinase C. Pain et al. charactaized 
lengfli cDNAs encoding human and Drosophila synaptotagmins (Perin et al., Nature 345:260- 
263, 1991). Similarity of the phospholipid binding properties of the cytoplasmic domains of 
rat, human, and Drosophila syn^totagmins and selective conservation of the sequences that 
are homologous to protein kinase C suggested that these may be myolved in phospholipid 
binding. - 

The above defined information for NOV6 suggests that NOV6 may function as a 
member of a synaptotagmin family. Therefore, the NOV6 nucleic acids and proteiiis of the 
invention are useful in potential therapeutic plications implicated in various diseases and 
disorders described below and/or other pathologies. For example, the NOV6 compositions of 
the present invaition will have efficacy for treatment of patients sufifering &om Atopjr, 
Osteoporosis-pseudoglioma syndrome; Smith-Lemli-Opitz syndrome, type I; Smith-LemU- 
Opitz syndrome, type H; Xeroderma pigmentosum, group E, subtype 2; Asthma, atopic, 
susceptibility to; Diabetes mellitus, insuUn-dependent, 4; Susceptibility to IDDM; 
Angioedema, hereditary; Paraganglioma, familial nonchromaffin, 2; Von Hqjpel-Lindau 
(VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, hypercalceunia, Parkinson's 
disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Njiian syndrome, multiple 
sclerosis, ataxia-rtelangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, 
pain, neuroprotection; metabolic disorders and Lambert-Eaton myasthenic syndrome. The 
NOV6 nucleic acid encoding syn^totagmin-like protein, and the synaptotagmin-like protein 
of the mvention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 
NOV7 



51 
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10 



15 



A disclosedNOV? nucleic^acid of 11 64 nucleotides (^^ : : 

wugc_draflji_nh0781m21_20000809_dal) encoding a novel Serine Protease TLSP-like 
receptor protein is shown in Table 7A. An open reading frame was identified beginning with 
an ATG initiation codon at nucleotides 1 13-1 15 and ending with a TAG codon at nucleotides 
854-856. Putative untranslated regions are found upstream from the initiation codon and 
downstream from the termination codon m Table 7A, and the start-and stop codons are in bold 
letters. 



Table 7A. NOV7 Nucleotide Sequence (SEQ ID NO:17) 



CTGCCTT6CTCCACACCTGGTCAGGGGAGA6AGGG6AGGAAAGCCAAG6GAAGGGACCTAACTGAAAACAA 
ACAAGCTGGGAGAAGCAGGAATCTGCGCTCGGGTTCCGCAGA TGCAGAGGTTGAGGTGGCTGCIGGCArTCifi 

AAGTCATCGGGCAGAGGTCTCACAGCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAG 

GATTCTGCAGTTAATCCTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGGT 

TCGAGTGCAAGCCTCACTCCCAGCCCTGGCAGGCAGCCGTGTTCGAGAAGACGCGGCTACTCTGTGGGGCG 

ACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAGCCCCTCCCCAACAAAGACCGCCG 

CAATGACATCATGCTGGT6AAGATG6CATCGCCAGTCTCCATCACCTGGGCTGTGCGACCCCTCACCCTCT 

CCTCACGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCCGGCTGGGGCAGCACGTCCAGCCCCCAGTTA 

CGCCTGCCTCACACCTTGCGATGCGCCAACATCACCATCATTGAGCACmGAAGTGTGAGAACGCC 

CGGCAACATCACAGACACCATGGTGTGTGCCAGGGTGCAGGAAGGGGGCAAGGACTCCTGCCAGGGTGACT 

CCGGGGGCCCTCTGGTCTGTAACCAGTCTCTTCAAGGCATTATCTCCTGGGGCCAGGATCCGTGTGCGATC 

ACCCGAAAGCCTGGTGTCTACACGAAAGTCTGCAAATATGTGGTCTGGATCCAGGAGACGATTAAGAACAA 

TTA GGCTGGACCCACCCACCACAGCCCATCACCCTCCATTTCCACTTGGTGTTTGGTTCCTGTTCACTCTG 

TTAATAAG AAACCCTAA6CCAAGACCCTCTACGAACATTCTTTGGGCCTCCT6GACTACAGGAGATGCTGT 

CACTTAAT AATCAACCTGGGGTTCGAAATCAGTGAGACCTGGATTCAAATTCTGCCTTGAAATATTGTGAC 

TCTGGGAATGACAACACC TGGTTTGTTTTTTGTTGTATCCCCAGCCCCAAAGACAG CTCCTGGCCATATAT 

CAAGGTTTCAATAAATATTTGCTAAATG " 



The disclosed NOV7 nucleic acid sequence, localized to chromosome 19, has 531 of 
607 bases (87%) identical to a Hov%o sapiens trypsin-like serine protease (TLSP) mRNA 
(gb:GE^®ANK-ID:AP164623|acc:AF164623) (E== 1.3e-^^^). 

A disclosed N0V7 polypeptide (SEQ ID NO: 18) encoded by SEQ ID NO: 17 is 247 
amino acid residiies and is presented using the one-letter amino acid code in Table 7B. Signal 
P, Psort and/or Hydropathy results predict that NOV7 contains a signal peptide and is likely to 
be localized in the mitochondrial inner membrane with a certainty of 0.6921 and to the plasma 
membrane with a certainty of 0.6500, The most likely cleavage site for a N0V7 peptide is 
betwe^ amino acids 50 and 51, at: VGG-ET. 



20 



Table 7B. Encoded NOV7 protein sequence (SEQ ED NO:18). 



MQRLRWIJlDWKSSGRGLTAAlSPGARSSPlSa^^ 

FEKTRLLCGATLIAPRWLLTAAHCLKPLPNKDRRNDIMLVKM?VSPVSITWAVRPLTLSSRCVT^^ 

GWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTMVCASVQEGGKDSCQGDSGGPLVCNQSLQGI 
ISWGQDPCAirRKPGVYTKVCKYWWIQETIKNN 



The NOV7 amino acid sequence has 146 of 149 amino acid residues (97%) identical 
to, and 147 of 149 amino acid residues (98%) similar to the Homo Sapiens 282 amino acid 
residue serine protease (TLSP) protein (ptnr:SPTKEMBL*ACC:075 83 7) (E = 5 .2e''^^^). 
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N0V7 is a spliced isofoim of .the serine protease (TLiSP) iOroin Horno sapiens 
(GenBank ID: AB012917). It is missing 105 nucleotides between positions 406 and 407. 
Deletion of this exon resulted in a deletion of 35 amino acid residues between positions 98 and 
99 in the protein sequence. 

NOV7 is expressed in at least the following tissues: Colon, Heart, Lung, Ovary, 
Parotid Salivary glands. Prostate, Salivary Glands, Stomach (normal), Stomach (poorly 
differentiated adenocarcinoma with signet ring cell) Testis and Uterus. In addition, the 
seqixence is predicted to be e:q)ressed in the following tissues/cell lines because of the 
expression pattern of a closely related Homo sapiens trypsin-like serine protease (TLSP) gene 
homolog (GENBANK-m: gb:GENBANK:-ID:AF164623|acc:AF164623):brain, thymus, 
sple«i, liver and in breast carcinoma cell line BT-474. 

NOV7 also has homology to the amino acid sequence shown in tiie BLASTP data 
listed in Table 7C. 



Table 7C. BLAST results for NOV? 


Gene Index/ Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|364 97 91|dbi IBAA33 
404. Ij (AB012917) 


serine protease 
(TLSP) [Homo 
sapiens J 


282 


244/282 
(86%) 


245/282 
(86%) 


le-124 


gi I 5803199 IreflNP 00 
6844.11 


kallikrein 11; 
protease, serine, 
trypsin-like; 
protease, serine, 
20 trypsin-like 
[Homo sapiens] 


250 


212/250 
(84%) 


213/250 
(84%) 


le-107 


gi| 6681654 Idbi IBAA36 
955.11 (AB016227) 


hippos t as in 
prostate type 
[Mus mus cuius] 


276 


191/282 
(67%) 


214/282 

(75%) . 


le-101 


gil 9910298 ireflNP 06 
4358.11 


protease, serine, 
20; liippostasin 
[Mus musculus] 


249 


175/248 
(70%) 


194/248 . 
(77%) 


5e-96 


Hil9296988IsDl09UK09|li: 
LK9 HUMAN 


kallikrem 9 precursor 
(kallikrein-like protein 
3) (KLK.L3) [Homo 
sapiensi 


250 


117/242 
(48%) 


152/242 
(62%). 


7e-57 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 7D. ' 

Table 7D. Information for the ClustalW proteins 

1) NOV? {SEQIDNO:18) 

2) ^^^^^:i^^)^^J^'^-^'^M 7 1 (AB012917) serine protease CTLSP) [Homo sapiensi (SEQ ID NO:65) 

t 'Sm^]^^£^c^^T.l ; i hippostasin prostate type [Mus musculus] (SEQ ID NO:67) 

S ^Q?Q?oo!^ ?J^T,r^l^r'^ protease, serioe. 20; hippostasin [Mus musculus] (SEQ ID NO:68) 

s% £S (SQ roNS 69) """^^"^ ^ ^"^^ (kallikrein-Ifli Utein 3) (KLK-L3) [Homo 
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NOV7 

gi I 3649791 1 
gi I 5803199 1 
git 66816541 
gi I 9910298 1 
g± I 9296988 1 

NOV7 

gi I 3649791 1 
gi 15803199 I 
gi I 6681654 1 
gi I 9910298 1 
gi I 9296988 1 

NOV7 

gi I 3649791 1 
gi 15803199 I 
gi) 6681654 J 
gi I 99102981 
gi I 92969881 

NOV7 

gi|3649791| 
gi I 58031991 
gi I 66816541 
gi I 9910298 1 
gi I 9296988 1 

NOV7 

gi 1 3649791 1 
gi 1 5803199 1 
gi I 66816541 
gi I 9910298 1 
gi I 9296988 1 



1 
1 
1 
1 
1 
1 

61 
61 
29 
55 
28 
30 

98 

121 

89 

115 

88 

90 



MQRLRWLRDWKS SGRGLTAAKEPGARS SPLQAMR 
MQRLRWLRDWKSSGRGLTAAKEPGARSSPLQAMR 

MRI 

MRRLKSDWKLSTETREPGARPALLQARM 



-MKLG 



PHSQPWQAALFgKTRLLCGATLlAPRWLLTAAHCfflKPB 
pHSQPWQAALFgKTRLLCGATLIAPRWLLTAAHCgKpiY 
jPHSQPWQAALEgECrRLLCGATLIAPRWLLTAAHcHKplY 
PHSQPWQgALFgKTRLLCGATLIAPjJWLLTAAHclKpSY 
PHSQPWOBALFfflKTRLLCGATLIAPgWLLTAAHci^ 
,P{3SQPWQABLFiS3TRLBCGATLl^RWLLTAARPaPrPr 



^Mi^^^^^aL.PNKDjgRNDIMLVKMgSPV>a^<Tii 

ATESFPHPgFNNSLPNKDHRNDlMLVKMBs PV^T* 

atesfphpHfnnslpnkdhrndimlvkmSspv SSt 

ATESFPHPgFNNSLPNKDHRNDlMLVKMfisPV St 
ATES FPHPffl FNNSLPNKDHRNDlMLV^is PvSaTtl 
BTjggFPHpgFNagLglMnHimnTMTJ 




SAVgPLTLS 
llAVgPLTLS 

avSpltls 

SpIiTLS 
JAvfflpLTLS 

avHplISls 




146 'S 
181 £ 

149 -S 
175 p: 
148 m 

150 y 



T S S PQLRL PHTLRC AN I 
TSSPQLRLPHTLRCANI 
T S S PQLRLPHTLRCANI 
T S S PQLRL PHgLRCAN 
T S S P QLRL PhIlRCAnI 
SSPl^JSPffl^^SCANl 



IIEHggCEj|AYPGNITDTMgCASVgEGGKDSCQGDSGSPLV 
IIEHggcEgAYPGNITDTMgCASvffiEGGKDSCQGDSGGPLV 
IIEHSScE|AYPGNITD™HcASvffiEGGKDSCQGDSGGPLv' 
IIEHgCElAYPGNITDTMlCASvSggGKDSCQGDSGGPLV 
■''n^S^Si^^^^^-^'^^'^^ JCASVraGKDSCQGDSGGPLV 
^gES^CgglAYPGgllgDaM ic^^^GmsCQGDSGGPhV 



205 
240 
208 
234 
207 
209 




Tables 7E and 7F Ust the domain description from DOMAIN analysis results against 
NOV7. This indicates that the NOV7 sequence has properties similar to those of other 
proteins known to contain this domain. 



. , Table TE. Domain Analysis of NO V7 

ghelf'^ro'^^^n'^^^^^ ^ Tryp^SPc, Trypsin-like serine protease; Many of 
dSL fLf^^H r'^.^^^'^^^'^^^ precursor .zymogens that are cleaved 

f^''^^''^^^'-^ generate their active forms. A few, 
however, are actxve as single chain molecules, and others are inactive 

Sena^h """^Al the catalytic triad residues. (SEQ nS?96) 

Length = 230 residues, 100.0% aligned «w.:7o/ 
Score - 210 bits (535), Expect ^ 7e-56 



NOV7 

00020 

NOV7 

00020 

N0V7 

00020 

NOV7 

00020 



53 
1 

106 

61 

136 

121 

194 

181 



RIIKGFECKPHSQPWQAALF-EKTRLLCGM 

" ' ' ' I- I M +1 I '1 I +1 l + l I l + I I 1 M 1+ 1 

RIVGGSEi^lGSFPWQVSLQYRGGIUIFCGGSLISPRWLTAAfiCVYGSAPSSIRVRLGSH 



NDIMLVKMAS PVS ITWAVRPLTL— S SRCVTA 
DLSSGEETQTVKVSKVIVHPNYNPSTyoiiiAiiilLp^^^ 

''''''''' I I I 1+ 1+ 1+ + 1 III I M i + i I nil 

GTTCTVSGWGRTSESSGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNMLC^^ 

DSCQGDSGGPLVCNQS LQGIISWGQDPCAITRKPGVYTKVCKYWWI 240 

J+M I II 1 I I 1 I I I I I 1 + 1 II 11 -II 11 I I + I 1+ I I ' ' ' 

DACQGDSGGPLVCNDPRWVLVGIVSWGSYGCARPNKPGVYTRVSSY^ 230 



105 
60 

135; 

120 
193 
180 
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Table m Domain Analysis of NOV7 

gnl |Pfam|pfam00089, trypsin. Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SEQ ID NO:97) 
Length = 217 residues, 100.0% aligned 

Score = -172 bits (435), Expect = 3e-44 



N0V7 


54 


00089 


1 


NOV7 


98 


00089 


61 


N0V7 " 


141 


00089 


121 


N0V7 


201 


00089 


178 



IIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHCLK 97 

i+ I I + I III +1 + 11 +11+ I + 1II1M + 

IVGGREAQAGSFPWQVSLQVSSGHFCGGSIiISENWVLTAAHCVSGASSVRWLGEHNLGT 60 

'• PLPNKD-RRNDIMLVKMASPVSITWAVRPLTLSSRC — VTAGTSCL 140 

.11+ III 1 + 1+ I II++ 1 11+ I I . • + I l + I 
TEGTEQKFDVKKIIVHPNYNPDTNDIALLKLKSPVTLGDTVRPICLPSASSDLPVGTTCS 120 

ISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTMVCASVQEGGKDSCQGDS 200 

-♦•iiii .1+1 11+ + 1+ + 1 +11 I +1111+11 Mii+iirii 



GGPLVC-NQSLQGIISWGQDPCAITRKPGVYTKVCKYWWI 240 
I I M II + I I l + I I I 11+ I I 1 I l + l +1+ 1 I 

GGPLVCSDGELVGIVSWG- YGCAVGNYPGVYTRVSRYLDWI 217 



The amino acid sequence of NOV7 has high homology to other proteins as shown in 
Table 7G. 



Table 7G. BLASTX results for NOV? 


Reading High 

Sequences producing High-scoring Segment Pairs: Frame 


Smallest 
Sum 
Prob 
Score P(N1 




M . 


patp:AAY42439 CASB12 amino acid sequence, Homo Sapi 282 aa. . +2 
patp:AAB11712 Huma serine protease BSSP6, Homo Sapi 282 aa. . +2 
patp:AAY43636 Human prostate-associated serum protease. Homo Sapi 282 aa.. +2 


792 
792 
792 


3.0e- 
3.0e- 
3.0e- 


-130 
-130 
-130 


1 
1 
1 



10 



15 



The trypsin femily is almost totally confined to animals, although trypsin-like enzymes 
are found in actinomycetes of the genera Streptomyces and Saccharopolyspora, and in the 
fungus Fusarium oxysporum . The enzymes are inherently secreted, being synthesised with a 
signal peptide that targets them to the secretory pathway. Animal enzymes are either secreted 
directly, packaged into vesicles for regulated secretion, or are retained in leukocyte granules. 

Proteases play a pivotal role in several biologic processes, including tissue remodeling 
and cell migration. By PGR of human hippocampus cDNA using primers derived fix)m mouse 
neuropsin cDNA sequences corresponding to conserved regions of serine proteases, a novel 
serine protease, KUCl 1, was identified which was named TLSP. The deduced 260-amino acid 
protein contains a signal peptide, 3 key amino acids essential for serine protease activity, an 
asp residue in a position that suggests a trypsin-type substrate specificity for basic amino acids 
at the PI position, conserved amino acids that can form an oxyanion hole, and a potential N- 
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glycosylation site. KLKl l shares'48°/i airiiio abid ,sequerice?identity with mouse neuropsm, 
43% identity with both human trypsin-1 and human kallikrdn, and 38% identity with the 
mouse nerve growth fector gamma subunit. Western blot analysis of recombinant KLKl 1 
suggested that the protein is secreted and posttranslationally processed. 
5 Proteolytic enzymes have been readily used in traditional medicine and studies have 

shown that enzyme ther^y can reduce the adverse effects caused by radiotherapy and 
chemotherapy. There is also evidence that, in some types of tumours, survival may be 
prolonged. The beneficial effect of systemic enzyme therapy seems to be based on its anti- 
inflammatory potential (Leipner and Sailer, Drugs 59(4):769-80, 2000). 
10 The above defined information for N0V7 suggests that this NOV7 protein may 

function as a member of a Serine Protease TLSP family. Therefore, the NOV? nucleic acids 
and proteins of the invention are usefiil in potential therapeutic ^pUcations impUcated in 
various diseases and disorders described below and/or other pathologies. For example, the 
NOV? compositions of the present invention wiU have efficacy for treatment of patients 
15 suffering firom.cancer, neurological disorders, digestive system disorders and all or some of 
the protease/protease inhibitor deficiency disorders.. The NOV? nucleic acid enco<Mng Serine 
Protease TLSP-like protem, and the Serine Protease TLSP-like protein of the invention, or 
firagments thereof, may fimher be usefijl in diagnostic applications, wherein the presaice or 
amount of the nucleic acid or tiie protein are to be assessed. 
20 NOV8 

'N0V8 includes four novel Glypican-2 Precursor-hke protems disclosed below. The 
disclosed proteins have been named NOVSa, NOVSb, NOV8c and NOVSd. 
■ NOVSa •• '\; 

• A disclosed NOVSa nucleic acid of 1 785 nucleotides (also referred to 

25 134913441_EXT) encoding a novel Glypican-2 Precursor-hke protem is shown in Table 8A. 
An open reading fiWe was identified beginning with an ATG initiation codon at nucleotides 
1-3 and ehdmg with a TAA codon at nucleotides 1?38-1?40. A putitive untranslated region 
downstoeam from the termination codon is underUned in Table 8 A, and the start and stop 
codons are in bold letters. 



Table 8A. NOVSa Nucleotide Sequence (SEQ ID NO:19) 



TCTGGTTCACACACTGGCTGCCAGGCACAGAAAftTTTGRTGAGTTTTTTCTGGA6AT6CTCTCaGTAGCr.C 
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AGCACTCTCTGACCCAGgrCTTCTCCCACTCCTACGGCCGCCTGTATGCCCAGCACGCCCTCATATTCAAT 

GGCCTGTTCTCTCGGCTGCiGAGACTTCTATGGGGAATCTGGTGAGGGGTTGGATGACACCCTGGCGGATTT 

CTGGGCACAGCTCCTGGAGAGAGTGTTCCCGCTGCTGCACCCACAGTACAGCTTCCCCCCTGACTACCTGC 

TCTGCCTCTCACGCTTGGCCTCATCTACCGATGGCTCTCTGCAGCCCTTTGGGGACTCACCCCGCC6CCTC 

CGCCTGCAGATAACCCGGACCCTGGTGGCTGCCCGAGCCTTTGTGCAGGGCCTGGAGACTGGAAGAAATGT 

GGTCAGCGAAGCGCTTAAGGTTCCGGTGTCTGAAGGCrGCAGCCAGGCTCTGATGCGTCTCATCGGCTGTC 

CCCTGTGCCGGGGGGTCCCCTCACTTATGCCCTGCCAGGGCTTCTGCCTCAACGTGGTTCGTGGCTGTCTC 

AGCAGCAGGGGACTGGAGCCTGACTGGGGCAACTATCTGGATGGTCTCCTGATCCTGGCTGATAAGCTCCA 

GGGCCCCTTTTCCTTTGAGCT6ACGGCCGAGTCCATTGGGGTGAAGATCTCGGAGGGTTT6ATGTACCTGC 

AGGAAAACAGTGCGAAGGTGTCCGCCCAG6TATTTCAGGAGTGCGGCCCCCCCGACCCG6TGCCTGCCCGC 

AACCGTCGAGCCCCGCCGCCCCGGGAAGAGGCGGGCCGGC*GTGGTCGATGGTGACCGAGGAGGAGCGGCC 

AACGACC6CCGCAGGCACCAACCTGCACC6GCTGGTGTGGGAGCTCCGCGAGCGTCTGGCCCGGATGCGGG 

GCTTCTGGGCCCGGCTGTCCCTGACGGTGTGCGGAGACTCTCGCATGGCAGCGGACGCCTCGCTGGAGGCG 

GCGCCCT6CT6GACCG6AGCCGGGCGGGGCCGGTACTTGCCGCCAGTGGTCGGGGGCTCCCCGGCCGAGCA 

GGTCAACAACCCCGAGCTCAAGGTGGACGCCTCGGGCCCCGATGTCCCGACACGGCGGCGTCGGCTACAGC 

TCCGGGCGGCCACGGCCAGAATGAAAACGGCCGCACTGGGACACGACCTGGACGGGCAGGACGCAGATGAG 

GATGCCAGCGGCTCTGGAGGGGGACAGCAGTATGCAGATGACTGGATGGCTGGGGCTGTGGCTCCCCCAGC 

CCGGCCTCCTCGGCCTCCATACCCTCCTAGAAGG6ATGGTTCTG6GGGCAAAGGAGGAGGTGGCAGTGCCC 

GCTACAACCAGGGCCGGAGCAGGAGTGGGGGGGCATCTATTGGTTTTCACACCCARACCATCCTCATTCTC 

TCCCTCTCAGCCCTGGCCCTGCTTGGACCTCGATA ACGGGGGAGGGGTGCCCTAGCfl TrAnnnffprcTTraT 

GGCCCTTTCC 



The disclosed NOVSa nucleic acid sequence, localized to chromosome 7, has 1469 of 
1 785 bases (82%) identical to a Rattus norvegicus cerebroglycan mRNA (gb:GENBANK.- 
ID:RATCRBGLVC|acc:L20468) (E = 3.3e-2"). 

A disclosed NOVSa polypeptide (SEQ ID NO:20) encoded by SEQ ID NOrlP is 579 . 
amino acid residues and is presented using the one-letter amino acid code in Table 8B. Signal 
P, Psort and/or Hydropathy results predict that NOVSa contains a signal peptide and is likely 
to be localized extracellularly with a certainty of 0.4467. The most likely cleavage site for a 
NOVSa peptide is between amino acids 23 and 24, at: GPG-SE. 



Table 8B. Encoded NOVSa protein sequence (SEQ ID NO:20). 



^m^fi;^J:^:^^^*^^°^^^^^'^^^^^^^^'^T^°VL'2^SYSLNl.IPPALISGEHLRVCPQEyTCCSS 

SSRGLEPDWGNYLDGLLIIADKLQGPrSFELTAESIGVKISEGLMYLQENSAKVSAQVFQECGPPDPVPAR 
APCWTGAGRGRYLPPWGGSPAEQVNNPEIJWDASGPDVPTIWRRLQLWATAWlKTAALGH^G^s^ 



The NOVSa amino acid sequence has 477 of 579 amino acid i^idues (82%) identical 
to, and 513 of 579 amino acid residues (88%) similar to. the Rattus norvegicus 579 amino acid 
residue glypican-2 precursor (cerebroglycan) protein (ptnnSWISSPROT-ACC-P51653)(E = 
l;le-2^. 

NOVSa is expressed in at least the following tissues: Kidney, Spleen, Brain, Pediatric 
pre-B cell acute lymphoblastic leukemia. This information was derived by detemiiniQg the 
tissue sources of the sequences that were included in the invention. SeqCalling sources: 
Kidney, Spleen, Brain; PubUcEST sources: Pediatric pre-B cell acute lymphoblastic leukemia. 
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; In addition, NOV8a is predicted to be expressed in brain tissues because of the expression 
pattern of a closely related Rattus norvegicus cerebroglycan mRNA homolog (GENBiWK- 
ID:gb:GENBANK-lD:RATCRBGLVC|acc:L20468). . .. 
NOV8b 

A disclosed NOV8b nucleic acid of 1976 nucleotides (also refened to CG50970-02) 
encoding a novel Glypican-2 Precursor-like protein is shown in Table 8C. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 54-56 and ending 
with a TAA codon at nucleotides 1449-1451. Putitive untranslated regions upstream fiom the 
intiation codon and downstream from ttie tmnination codon is underlined in Table 8C, and the 
start and stop codons are in bold letters. 



Table 8C. NOV8b Nucleotide Sequence (SEQ ID NO:21) 




: The disclosed NOV8b nucleic acid sequence, localized to chromosome 2q35-q37 has 
1047 of 1271 bases (82o/„) identical to a Rattus norvegicus cerebroglycan mRNA . 
(gb:GENBANK-ID:RATCRBGLVCiacc:L20468.1) (E = 1.4e-24'). 

A disclosed NOV8b polypeptide (SEQ ID NO:22) encoded by SEQ ID NO-21 is 465 
ammo acd residues and is presented using the one-letter amino add code to Table 8D. Signal 
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P, Psort and/or Hydropathy results predict that NOVSb contains a signal peptide and is Ukely 
to be locahzed extracellularly with a certainty of 0.4467. The most likely cleavage site for a 
NOVSb peptide is between amino acids 23 and 24, at: GPG-SE. 



Table 8D. Encoded NOVSb protein sequence (SEQ ID NO:i22). 



MSALRPLLLLLLPLCPGPGPGPGSEAKVTRSCAETRQVLGARGYSLNLIPPALISGEHLRVCPQEYTCCSS 

ETEQRLIRETEATFRGLVEDSGSFLVHTLAARHRKFDEFFLEMLSVAQHSLTQLFSHSYGRLYAQHALIFN 

GLFSRLRDFYGESGEGLDDTLADFWAQLLERVFPLLHPQYSFPPDYLLCI.SRLASSTDGSLQPEX3DSPRRL 

RLQITRTLVAARAFVQGLETGRNWSEALKVPVSEGCSQALMRLIGCPLCRGVPSLMPCQGFCLNWRGCL 

SSRGI^;PDWGNYLDGLLIIADKLQGPFSFELTAES1GVKISBGXMYLQENSAKVSAQVFQECGPPDPVPAR 

NRRAPPPREEAGRLWSMVTEEERPSADEDASGSGG6QQYADDWMAGAVAPPARPPRPPYPPRRDGSGGKGG 
GGSARYNQGRSRS6GASIGFHTQTILILSLSALAI.LGPR 
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The NOVSb amino acid sequence has 322 of 3S0 amino acid residues (84%) identical 
to, and 348 of 380 amino acid residues (91%) similar to, the Rattus norvegicus 579 amino acid 
residue glypican-2 precursor (cerebroglycan) protein (ptnr:SWISSPROT-ACCP51653) (E = 
1.5e2»°). 

NOVSb is expressed in at least the following tissues: Aorta, Brain, CartUage, Cervix, 
Liver, Lung, Oviduct/Uterine Tube/Fallopian tube. Parotid Salivary glands. Placenta, Prostate, 
Retina, Skeletal Muscle, Stomach, Tenqjoral Lobe, Testis, Vein. This iiifoimation was derived 
by determining the tissue sources of the sequences that were included in the inveotion 
including but not limited to SeqCalling sources. Public EST sources. Literature sources, and/or 
RACSsources. 

NOVSc 

A disclosed NOVSc nucleic acid of 1613 nucleotides (also referred to CG50970-03) 
encoding a novel Glypican-2 Precursor-like protein is shown in Table 8E. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 1-3 aiid ending 
with a TGA codon at nucleotides 134S-1350. A putitive untranslated region downsfream from 
the tennination codon is underlined in Table 8E, and the start and stop codons are in bold 
letters. 



Table 8E. NOVSc Nucleotide Sequence (SEQ ID NO:23) 



ATGTCCGCGCTGCGACCTCTCCTGCTTCTGCT6CTGCCTCTG TGTCCC6GTCCTG6TCCC6GAC 

^S^S"^^°^'^^^°^^°^^-^^^^'^^^^^T<=T«5CAGAGACCCGGCAGGTGCTGGGGGCCCGGGG 

ATATAGCTTAAACCTAATCCCTCCCGCCCTGATCTCAGGTGAGCACCTCCGGGTCTGTCCCCAG 

GAGTACACCTGCTGTTCCAGTGAGACAGAGCAGAGGCTGATCAGGGAGACTGAGGCCACCTTCC 

GAGGCCTGGTGGAGGACAGCGGCTCCTTTCTGGTTCACACACTGGCTGCCAGGCACAGAAAATT 

TGATGAGTTTTTTCTGGAGATGCTCTCAGTAGCCCAGCACTCTCTGACCCAGCTCTTCTCCCAC 

TCCTACGGCCGCCTGTATGCCCAGCACGCCCTCATATTCAATGGCCTGTTCTCTCGGCTGCGAG- 

ACTTCTATGGGGAATCTGGTGAGGGGTTGGATGACACCCTGGCGGATTTCTGGGCACAGCTCCT 

GGAGAGAGTGTTCCCGCTGCTGCACCCACAGTACAGCTTCCCCCCTGACTACCTGCTCTGCCTC 

TCACGCTTGGCCTCATCTACCGATGGCTCTCTGCAGCCCTTTGGGGACTCACCCCGCCGCCTCC 

GCCTGCAGATAACCCGGACCCTGGTGGCTGCCCGAGCCTTTGTGCAGGGCCTGGAGACTGGAAG- 

:AAATGTGGTCAGCGAAGC6CTTAAGGTGCCGGTGTCTGAAGGCTGCAGCCAGGCTCTGATP!rr:T 
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CTCATCGGCTGTCCCCfG^^ 



TCTCCTGATCCTGGCTGATAAGCTCCAGGGCCCCTTTTCCTTTGAGCTGACGGCCGAGTCCATT 
GGGGTGAAGATCTCGGAGGGTTTGATGTACCTGCAGGAAAACAGTGCGAAGGTGTCCGCCCAGG 
TGTTTCAGGAGTGCGGCCCCCCeGACCCGGTGCCTGCCCGCAACCGTCGAGCCCCGCCGCCCCG' 
GGAAGAGGCGGGCCGGCTGTGGTCGATGGTGACCGAGGAGGAGCGGCCCACGACGGCCGCAGGC 
ACCAACCTGCACCGGCTGGTACTTGCCGCCAGTGGTCGGGGGCTCCCCGGCCGAGCA6GTCAAC 
AACCCCGAGCTCAAGGTGGACGCCTCGGGCCCCGATGTCCCGACACGGCGGCGTCGGCTACAGC 
TCCGGGCGGCCACGGCCAGAATGAAAACGGCGGCACTGGGACACGACCTGGACGGGCAGGACGC 
GGATG AGGATGCCAGCGGCTCTGGAGGGGGACAGCAGTATGCAGATGACTGGATGGCTGGGGCT 
GTGGCTCCCCCAGCCCGGCCTCCTCGGCCTCCATACCCTCCTAGAAGGGATGGTTCTGGGGGCA 



AAGGAGGAGGTGGCAGT6CCCGCTACAACCA66GCCGGAGCAGGAGTGGGGGGGCATCTATTGG 



TTTTCACACCCAAACCATCCTCATTCTCTCCCTCTCAGACCTGGCCCTGCTTGGACCTCGATAA 



CGGGGGAGGGGTG 



The disclosed NOV8c nucleic acid sequence, localized to chromosome 2, has 994 of 
1 172 bases (84%) identical to a Rattus norvegicus cerebroglycan mKNA (gb:GENBANK- 
ID:RATCRBGLVC|acc:L20468.1) (E = 1.3e-^^). 

A disclosed NOVSc polypeptide (SEQ ID NO:24) encoded by SEQ ID NO:23 is 449 
amino acid residues and is presented using the one-letter amino acid code in Table 8F. Signal 
P. Psort and/or Hydropathy results predict that NOVSc contains a signal peptide and is likely 
to be localized extracellularly with a certainty of 0.3700. The most likely cleavage site for a 
NOVSc peptide is between amino acids 23 and 24, at: GPG-SE. 



Table 8F. Encoded NOVSc protein sequence (SEQ ID NO:24). 



MSALRPLLLLLLPLCPGPGPGPGSEAKVTRSCflETRQVLGARGYSLNI.IPP aLISGEHIJlVCPQEYTCCSS 
ETEQRLIRETEATFRGLVEDSGSFl,VHTIJiARHRKFDEFFLEMLSVAQHSI.TQLFSHSyGRLYAQHALira 
GLFSRLRDFYGESGEGLDDTLADFWAQLLERVFPLLHPQYSFPPDYLLCLSRLASSTDGSLQPFGDSPRRL 

SSRGLEPDWGNYLDGLLILADKLQGPFSFELTAESIGVKISEGLMYLQENSAKVSAQVFQECGPPDPVPAR 



■ The NOVSc amino acid sequence has 334 of 391 amino acid residues (85%) identical 
to. and 359 of 391 amino acid residues (91%) similar to, the Rattus norvegicus 579 amino acid 
residue glypican-2 precursor (cerebroglycan) protein (ptnr:SWISSPROT-ACC:P51653) (E = 
1.4e-^«3). 

NOVSc is expressed in at least the following tissues: Aorta, Brain, Cartilage, Cervix. 
Liver, Lung, Oviduct/Uterine Tube/Fallopian tube. Parotid SaHvary glands. Placenta, Prostate. 
Retina, Skeletal Muscle, Stomach, Temporal Lobe, Testis, Vein. Expression mformation was 
derived from the tissue sources of the sequences that were included in the derivation of the 
NOVSc sequence. 

NOVSd ■ ' " " ■ 

A disclosed NOVSd nucleic acid of 725 nucleotides (also referred to CG50970-04) 
encoding a novel Glypican-2 Precursor-like protein is shown in Table 8G. An open reading 
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10 



15 



20 



25 



frame was identified begjiming with an ATG initiation cbdoii at nucleotides 160-162 and 
ending with a TAA codon at nucleotides 688-690. Putitive untranslated regions upstream from 
the initiation codon and downstream from the termination codon is underlined in Table 8G, 
and the start and stop codons are in bold letters. 



Table 8G. NOV8d Nucleotide Sequence (SEQ ID NO:25) 



CGCCTGGTCCAGCTATCGTGCTCGGrATTCAGTTTTCCGGAGCAGCGCTCTTTCTCTGGCCCGC 
GGAACGGTCCCGCG6CCGAGTACCGGATTCCCGAGTTTGGGA6GCTCTGCTTTCCTCCTTAGGA 
CCCACTTTGCCGTCCTGGGGTGGCTGCAGTTA TGTCCGCGCTGCGACCTCTCCTGCTTnTfir.Tfl 

CTGCCTCTGTGTCCCGGTCCTGGTCCCGGACCCGGGAGCGAGGCAAAGGTCACCCGGAGTTGTG 

CAGAGACCCGGCAGGTGCTGGGGGCCCGGGGATATAGCTTAAACCTAATCCCTCCCGCCCTGAT 

CTCAGGTGAGCACCTCCGGGTCTGTCCCCAGGAGTACACCTGCTGTTCCAGTGAGACAGAGCAG 

AGGCTGATCAGGGAGACTGAGGCCACCTTCCGAGGCCTGGTGGAGGACAGCGGCTCCTTTCTGG 

TTCACACACTGGCTGCCAGGCACAGAAAATTTGATGAGTTTTTTCTGGAGATGCTCTCAGTAGC 

CCGGCCTCCTCGGCCTCCATACCCTCCTAGAAGGGATGGTTCTGGGGGCAAAGGAGGAGGTGGC 

AGTGCCCGCTACAACCAGGGCCGGAGCAGGAGTGGGGGGGCATCTATTGGTTTTCACACCCAAA 

CCATCCTCATTCTCTCCCTCTCAGCCCTGGCCTTGCTTGGACCTCGATAACGGGGGAGGGGTGC 
CCTA6CATCAGAAGGGTTCAT "" 



- The disclosed NOV8d nucleic acid sequence, localized to chromosome 2, has 448 of 
545 bases (82%) identical to a.Rattus 7»orvegicMJ cerebroglycan mRNA (gbrGENBANK- 
ID:RATCRBGLVC|acc:L20468.1) (E = 4.2e-^°'). 

A disclosed NOV8d polypeptide (SEQ ID NO:26) encoded by SEQ ID NO:25 is 176 
amino acid residues and is presented using the one-letter amino acid code in Table 8H. Signal 
P, Psort and/or Hydropathy results predict that NOV8d contains a signal peptide and is likely ." 
to be localized extracellularly with a certainty of 0.4467. The most likely cleavage site for a , " 
NOy8d peptide is between amino acids 23 and 24, at: GPG-SE. 



Table 8H. Encoded NOVSd protein sequence (SEQ ID NO:26). 



MSALRPI.LLLLLPLCP6PGPGPGSEaKVTRSCAETR0VLGARGYSmLIP PALISGEHI.RVCPQEYTCCSS 
ETEQRLIRETEATFRGLVEDSGSFLVHTIAAWIRKFDEFFI^IMLSVARPPRPPYPPIWDGSGGKGGGGSAR 
YWQGRSRSGGASIGFHTQTILIL3LSALAIJ,GPR ' •■ ■■- " ''"Tr 



The NOVSd amino acid sequence has 1 03 of 1 19 amino acid residues (86%) identical 
to. and 1 14 of 1 19 amino acid residues (95%) similar to, the Raitus norvegicus 579 amino acid 
residue glypican-2 precursor (cerebroglycan) protein (ptnr:SWISSPROT-ACC:P51653) (E = 
2.6e-'^). 

NOVSd is expressed in at least the following tissues: Aorta, Brain, Cartilage, Cervix, 
Liver, Lung, Oviduct^terine Tube/Fallopian tube. Parotid SaUvary glands. Placenta, Prostate, 
Retina, Skeletal Muscle. Stomach, Temporal Lobe, Testis and Vein. Expression information 
was derived from the tissue sources of the sequences that were included in the derivation of 
the NOV8d sequence. 

• Possible SNPs found for GPCRSd are Usted in Table 81. 
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Table S 


I: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


227 


19 


T>C 


0.105 


482 


55 


OT 


0.036 


523 


55 


A>G 


0.036 


548 


.55 


G>A 


0.036 


573 


53 


G>A 


0.038 


684 


28 


T>C 


0.393 



alignment in Table 8 J, 



Table 8J Alignment of NOVSa - NOVSd 



NOV8A 
NOV8B 
NOV8C 
NOV8D 



NOVSA 
N0V8B 
NOV8C 
N0V8D 



NOVSA 
NOV8B 
NOV8C 
NOV8D 



NOVSA 
NOV8B 
NOV8C 
NOV8D 




50 



60 



1 



70 



80 

, I 



GZSLNIiIPPALIS.GEHIiRVCPQEYTCCSSETEQRLIRE 
ARGySI,NI.IPPALlSGEHI,RVCPQEyTCCSSETEQRLIRE 
ARGYSLNLXPPAIiISGEHLRVCPQEyTCCSSETEQRLIRE 
ARGYSI4NI.IPPALISGEHLRVCPQErTCCSSETEORr.TREi 



90 100 220 220 
I 1 .... ! I I ! 1 

^30 140 150 160 
I I 1 1 \ \ I I 



SLTQLFSHSYGRLYAQHALIPNGLFSRLRDFYGESGEGLD 
SLTQLFSHSYGRLYAQHALIFNGLFSRLRDPYGESGEGLD 

sltqi^hsygrlyaqhalifnglfsrlrdfygesgegldI 



[TE ATPRGIiVED S GSFLVHTIJ^ARHRKFDEFFLEMLSVAQH 
TEATFRGLVEDSGSPLVHTLAARHRBCFDEFFLEMLSVAQH 

teatprglvedsgsflvhtlaarhrkfdefflemlsvaqhI 

TEATPRGLVEDSGSPLVHTLAARHRKFDEFFLEMIiSV 



NOVSA 
NOVSB 
NOV8C 
NOVSD 



NOVSA 
NOVBB 
NOV8C 
NOVSD 



NOVSA 
NOVSB 
NOVSC 
NOVSD 



^70 180 190 200 
\ " > ■ I > . . . I I .... 1 1 .... I I 

210 220 230 . 240 

1. . . . 1 . . ; . 1 I I .... 1 I 

- ■ 250 - 260 270 280 
I I I ,\ I I 



TLZUDFWAQLLERVFPLLHPQYSFPPDYLLCLSRIiASSTD 
JTLADFWAQLLERVFPLLHPQYSFPPDYLLCLSRLASSTD 
)TL^EmQLLERVPPLLHPQYSFPPDYLLCLSRIASSTD 



GSLQPFGDSPRRLRLQITRTLVAARAFVQGLETGRNWSE 
GSLQPFGDSPRRLRLQITRTLVAARAFVQGLETGRNWSE 
GSLQPFGDSPRRLR LQXTRTLVaARAFVQGLETGRNWSE 
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N0V8A 
NOV8B 
NOV8C 
NOV8D 



NOV8A 
NOV8B 
N0V8C 
NOV8D 



NOV8A 
NOV8B 
NOV8C 
NOV8D 



NOfVSA 
NOV8B 
N0V8C 
KOV8D 



MOV6A 
MOV8B 
H0V8C 

Noveo 



NOV8A 
NOV8B 
NOV8C 
NOV8D 



NOV8A 
NOV8B 
NOV8C 
NC3VSD 



NOV8A 
NOV8B 
NOV8C 
N0V8D 



290 



( 



300 



310 

, . 1 :*. 



320 



RGCLSSRGIiEPDWGNYLDGIililliADKLQGPFSFELTAESI 
RGCLS SRGLE PDWGNYLD GLLIIiADKLQGPFSFEIiTAE S I 
RGCIiSSRGliEP DWGNYLD GIiLIIADKLQGPFSFELTAE S X 



330 



1 



340 
. . 1 . . 



350 



. 360 
1 .... I 



gvkxseglmyxiqensakvsaqvfqecgppdpv-parnrrap 
gvkisegimylqensakvsaqvfqecgppdpvparnrrap 
gvkisegl mylqensakvsaqvfqecgppdpvparnrrap: 



I 



370 

..I.... I 



pfreeagrlwsmvteee 

PPREEAGRIiWSMVTEEE 

ppreeagrlwsmvteeeI 



380 390 400 

RPTTAAGTmiHHIiVWEIiRERIiAR 
RPSADE 



410 420 430 440 

•••-I--.. I. ...I.. ..I. ...I.. .-I.... I. ...I 
MRGFRAIU:.SIiWC6DSPMAADASIiEAAPCV9T6A<Sl6RYIiP 



450 460 470 480 

....I.... I,... I.... )....!.... I.... I...., 

PW66SPAEQVNNPEIiKVDASGPDVPTRRRRI.Q1*RAAIAR 



^^0 500 510 520 

mktaalghdicgqdadedasgsgggqqyaddwmagavapp 
das6sgggqqxaddwmagavapp 




530 
I . . 



540 



550 



560 
■ . I 



-ggTTAAGTNLHRLVLAASGRGL 



570 



580 
I 



prrdgsggkggggsary 
prrd gsggkgg ggsary 

prrdgsggkggggsary) 



590 



. BNGiagGiCRPG : 




Homologies to either of the above NOV8 proteins wiU be shared by the other NOV8 
protein insofar as they are homologous to each other as shown above. Any reference to NOV8 
is assumed to refer to both of the N0V8 proteins in general, unless otherwise noted. 

The disclosed NOV8 polypeptide has homology to the amino acid sequences shown in 
the BLASTP data listed in Table 8K. • 
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Table 8K BLAST results for NSvSa 



Gene Index/ 
Identifier 

gi|1708021|sDlP5165 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


3iGPC2 RAT 

gi t 7106325 IreflNP 0 


glypican-2 

precursor 

( cer ebr oglycan ) 

(HSPG M13) [Rattus 

norvegicusl 


579 


476/581 
(81%) - 


512/581 
(87%) 


0.0 


35951.11 

gi |50317191ref INP 0 


glypican 6 [Mus 
musculus] 


555- 


226/512 
(44%) 


332/512 
(64%) 


le-124 


05699.11 

gi 1 6680059|ref INP 0 


glypican 6 
precursor [Homo 
sapiens] 


555 


225/512. 
(43%) 


330/512 
(63%) 


le-122 


32176.11 

gi 1 13879296 1 ah 1 AAHO 


glypican 4 [Mus 
musculus] 


557 


208/487 
(42%) 


314/487 
(63%) 


le-114 


6622-.11AAH06622 
(BC006622) 


glypican 4 [Mus ~ 
musculus] 


557 


208/487 
(42%) 


314/487 
(63%) - 


le-114. 



^ '"^^ J^^^^l^Sy between these and oflier sequent 
ClustalW analysis shown in Table 8L. 

Table 8L. ClustalW Analysis of NOV8a 

1) Novel NOVSa (SEQ ID NO:20) 

II ^S^f^^ni'y 03S951,,„1| glypican 6 [Mas musculus] (SEQ roNO-7n 
A 032176 1 glypican 4 fMus musculusl (SEO ID 

6) Ml3^79206|. h|AAH066->. 1,. . HO^.^^! (BC006622) iS^S mlculus] (SEQ IDNO:74) • 

NOV8A 

gi I 1708021 I 
gi I 71063251 
gi( 5031719 r 
gi I 66800591 
gi I 13879296 I 

NOV8A 

gi I 17080211 
gi I 71063251 
gi I 50317191 
gi I 6680059/ 
gi 113879296 I 

NOVSA . . , . . 

gi 1 1708021 1 
gi I 7106325 1 
gi I 5031719 1 
gi 166800591 
gi I 13879296} 

NOV8A 

gi 117080211 • 
gi J 71063251 
gi 15031719) 
gi] 6680059} 
gi 113879296 I 

NOV8A 

gi I 17080211 
gi J 71063251 
gi 1 5031719 1 
gi I 6680059 1 
gi 1 138792961 




241, AL 
■ 241 AL 
235 VS 
235 VS 
'235 VSvl 
•235'VSV 
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NOV8A 

gi 11708021 1 
gil 7106325 1 
gi 1 5031719 1 
gil 6680059 I 
gil 138792961 

N0V8A 

gil 1708021 I 
gil 7106325 1 
gil 5031719 1 
gil 66800591 
giU3879296| 

N0V8A 

gil 1708021 I 
gil 7106325 1 
gil 5031719 I 
gi I 6680059) 
gil 13879296 I 

KOV8A 

gi|1708021| 
gil 7106325 1 
gil 5031719 I 
gi I 6680059 1 
gil 13879296 I 

NOV8A 

gil 1708021 I 
gil 7106325 1 
gil 5031719 1 
gil 6680059 1 
gil 13879296 I 

NOV8A 

gil 1708021 I 
gil 7106325 1 
gil 5031719 1 
gil 6680059 1 
gil 13879296 I 




svmd'p 

SVMDP 

SsvmdpI 
vmd; 



360 ppBreeagrl^jsmvte 

360 PAjREETSRSWRSSAEa 

355 SA jEN-FNTKElRPYNPo 

355 SAgEN-FNTRPRPYNp'S 

355 SISESAFSARPRPYHpSq 

355 SISESAFSAKFRPYHPgQ 

420 gDASL 
420 gDLSi 
414 gGTSN 

414 gGTSM 

415 gGNEN 
415 BgH£] 




Vi^RNRRA 359 
|h|VQSRNRRA 359 
^""^'"IliRSAR 354 
RSAR 354 
RISR 354 
IGRISR 354 




LppJvvggsp; 
■sp.vwgsln; 
peimndgl' 

iPETtMNDGLTNl 

fa^'tgng; 

I'FAVTGNl 




480 RMKTAALGHDLDGQDAitjEDASG^GGGQQyADX: 
.478 RMKAAALGQDLDMHDifiBDASGSSGGGQQYADDn 

446 PgVEVDljTRPDTFIRQQlB 

44 6 PijvDVDIfTRPDTFIRQQIg 

447 P^VQVDlteKPDlLILRQll 

447 — ^ PBVQVDlgKPDlIiILRQlB 



•Sryi^qgrs 

HRraQGRS 

gs gcmddvcptefefvtteapavs^p-d: 

GSGCMDDVCPTEFEFVTTEAPAVpP-D: 




GSGCBYQQCPSEFEYNATDHfiGKpANEKADSAGGAHAE^^F 
E^GSGCEYQQCPSEFEYNATDHkGKgANEKADSAGGAHAEAKP 



419 

SrWa 419 
tlviT 413 
ESivlT 413 
ERblA 414 
ERMA 414 



iPELKVDASGPDVPTRRRRLQLRAATA 479 
— PELDTSSPDVPTRRRRLHLRAATA 477 

446 

446 

447 

447 

— GAVAPPARPPRPPYPPRaRfflGgGG 537 
~ APWPPARPPRPPRPPRfefflGLGV 537 
CiRVMTNKLKNAYNGNDVNEfeHrHDE 491 
[^RVMTKKLKKAYNGNDVNfI^fSdE 491 
LRVMTSKMRNAYNGNDVDF^IgDB 492 
RVMTSKMKNAYNGNDVDFFglgDE 492 

IGFH' 

vglhap: 

KFSSSj 
kQRGHSj 





- 579 
— 579 
YR 555 
iCR 555 
553 QGEWR 557 
553 C2GEWR 557 



Table 8M lists the domain description from DOMAIN analysis resiilts against NOVSa. 
This indicates that the NOVSa sequence has properties similar to those of other proteins 
known to contain these domains. 



Table 8M. Domain Analysis of NOVSa 

gnl I Pf am) pf am01153 , Glypican. (SEQ ID NO: 98) 
Length = 554 residues, .86 . 1% .aligned • 

Score. = 536 bits (1380), Expect = 2e-153 



N0V8a 


24 


01153 


\ll 


NOVSa 


B4 


01153 


77 


N0V8a 


144 


01153^ 


137 


NOVSa 


204 


01153 . 


193 



SEAKVTRSCAETRQVLGARGYSLNLIEPALISGEHLRVCPQEyTCCSSiSTEQRLIRETEA 83 
+1MII 11+ ll + l + III +1 + n||)I++l|i Mllli) + 

AEGSKSRSCAEVRQLFGAKGFSLNDVPQSEISGEHLQICPQGYTCCSSEMEEKLQLKARG 7 6 

TFRGLVEDSGSFLVHTLAARHRKFDEFFLEMLSVAQHSLTQLFSHSYGRLYAQHALIFNG 143 

' '++1111 II +1 1 I I 1 + I M I +11 I I I l + l +1 

DFEQLLQDSSSSLQFLLATNAKKFQEHFEELLNISENYLNALFSKTYGRLYPQNAEMFKD 13 6 

LFSRLRDFYGESGEGLDDTIADFWAQLLERVFPLLHPQYSFPPDYLLCLSRLASSTDGSL 203 
^^'^ I I++ I +111 + 1111 ] 1) IJ I IIJ II + I 

LFTELRLYYRGSNINLEEALNEFWARLLERAFKQLHGQYDSPDDYLECLRKARE DL 192 

^fT??^^^^^^^^^'^^^^^^^^^^G^ETGRNVVSEAIJCVPVSEGCS 263 

" -^i^M .|iN ii + M ))}ti Kill I ni+ +11 + 1+ H + I1+++I II , 

KPFGDIPRRLMLQVTRALVAARTFLQGLNVGIEWSKVDQVPLSKECSRALLKMIYCPHC ' 252 
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0XXS3 -3-i^.WvkiiviJiiii;^iii^^^^ 
■NO^ea-. 323 KISEGL^nfLQENSAKVS^^^ 

01153 .313 KISEAIMALQB:NGVKLTAKVFQGCG----TkpTPYGSASGplDKisKRpiKpiE^^^^^ 368 
•NOVea • 383 AGTNl^LWELRERI^GFWARLSL^^^^ .'^ 
01153 , • 3eS T---^iiivi^^iKjJ^3iis%ipGi.^i-ii^^^^^^^^ ' ,,3 

NOVSa 503 GGGQQYADDW 512 

I ) ii 
01153 484 GSGDGCGDDD 493 

Glypicans are a fannly of heparan sulfete proteoglycans which are anchored to cell 
membranes by a glycosylphosphalidylinositol (GPI) hnkage. Structurally, these proteins 
consist of three separate dOmams: asignal sequence, an extracellular domain of about 500 

residues that contains 12 conserved cysteines probably involved in disulfide bonds and wW^ 
also contains the sites of attachment of the heparan sulfate glycosaniinoglycan side chains and 
a C-tenmnal hydrophobic region which is post-translationally removed after fonnation of the 
GPI-anchor. Glypican-2 Precursor-like 

^^^''^ ^^^^^ mfonnation for NOV8 suggests that NOV8 may function as a 
member of a Glypican-2 Precursor fanuly. Therefore, the NOV8 nucleic acids and pmteins of 
the mvenhon are useful in potential therapeutic ^Kcations hnpUcated in various diseases and 
disorders described below and/or other pathologies. For example, the NOV8 compo^tions of 
the present invention will have efficacy for treatment of patients suffering fiom diabetes 
diabetes melHtus non-insulin dependent, autoimmune disease, renal artery stenosis, inferential 
n^hrxtis. glom^lonephritis. polycystic kidney disease, systemic lupus eryth^natosus, renal 
tubular acidosis. IgA nephropathy, hypercalcemia, Lesch-Nyhan syndxx>me , hemophiUa, 
hypercoagulation, idiopathic thix>mbocytopenic purpura . imrnunodeficiencies, graft versus 
host diseasc Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous 
sclerosis, Parkinson's disease, Hmitington's disease, cerebral palsy, epilepsy, multiple " 
sclerosrs, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiefy 
pam, neurodegeneration, cancer, developmental abnormalities, Acyl-CoA dehydrogenase' 
deficiency of long chain, Brachydactyly, type Al, Carbamoylphosphate synthetase I 
deficiency, Cardiomyopathy dilated II, Cataract Coppock-hlce, Cataract crystalline 

aculeiform, Cataractpolymorphic congenital, Cataract variable zonularpulverulent,Ca^^^^ • 
punctate progressive juvenile-onse, Ghoreoathetosis famihal paroxysmal. Craniofacial- ■ 
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deafiiess-hand syndrome. Ichthyosis lameUar, type 2, Myopathy, desmin-related • 
cardioskeletal, Resistance/susceptibiUty to TB, Rhabdomyosarcoma alveolar, Waardenburg 
syndrome type I and type m, Alport syndrome autosomal recessive, Bjomstad syndrome, 
. Hematuria, famiUal benign. Hyperoxaluria primary, type 1, Syndactyly type 1, 
5 Hypeiproglucagonemia, Bethlem myopathy, Brachydactyly type E, Brachydactyly-mental 
retardation syndrome, Finnish lethal neonatal metabolic syndrome, susceptibility to 2, 
Simpson-Golabi-Behmel syndrome, type 1 and type 2 and Beckwith-Wiedemann syndrome. 
The NOV8 nucleic acid encoding Glypican-2 Precursor-like protein, and the Glypican-2 
Precursor-Uke protein of the invention, or fragments thereof, may further be useful in 
1 0 diagnostic apphcations, wherem the presence or amount of the nucleic acid or the protein are 
to be assessed. 

NOV9 

A disclosed NOV9 nucleic acid of 985 nucleotides (also referred to 
ACOl 1 005_da2/139943578) encoding a novel Mitogen Activated Protein Kinase Kinase 2- 
15 like protein is shown in Table 9A. An open reading frame was identified beginning with an 
ATG initiation codon at nucleotides 54-56 and ending with a TGA codon at nucleotides 975- 
977. The start and stop codons are in bold letters. 
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Table 9A. N6V9 Nucleotide Sequence (SEQ ID NO:27) 



^S^^^^^^^'^ct^cccagaaagccaaggtcggcgaactcaaagacgatgacttcSSat^^ 

CACy^TGGACGGCGGCTCCCTGGACCAGGTGCTGaaAGAGGCCflAGAGGATTCCCGAGGSlTC?^^ 
^°^5^S=*^CeC'=S''TCTCCGGGGCTTGGCGTACCTCCGAGAGAAGC^^^ 

^^S^S^^^^'^^^^^^^^'^^'^S'SGCaCGCGCTCCTACaiTGGCTCCACCTCCTAAGCTG^^ 

tgtcttcacccccgacttccaggagtttgtcaataaatgcctcatcaagaacccagcggS 

TGAAGATGCTCACAAACCACACCTTaiTCAAGCGGTrr«ar.r.a.^r.»»«,.^r:^™„S^^ 



The disclosed NOV9 nucleic acid sequence has 754 of 759 bases (99%) identical to a 
20 Homo sapiens ERK activator kinase (MEK2) mRNA from (gbrGENBANK- 

ID:HUMMEK2NF|acc:Ll 1285) (E = 1 .Se^"). The NOV9 nucleic acid sequence contains 
numerous SNPs which result m various amino acid changes. 

A disclosed NOV9 polypeptide (SEQ ID NO:28) encoded by SEQ ID NO:27 is 307 
amino acid residues and is presented using the one-letter amino acid code in Table 9B. Signal 
P, Psort and/or Hydropathy results predict that NOV9 does not contain a signal peptide and is 
likely to be locaUzed in the cytoplasm with a certainty of 0.5500. 
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Table 9B. Encoded NOV9 protein sequence (SEQ iD>j6:2^). 



— w T ^ UI.61U sequence [piijij uj ^ 

DFAGWLCKTLRLNQPGTPTRTAV 




The NOV9 amiBO acid sequence has 236 of 236 amino acid residues (100%) identical 
to, and 236 of 236 annno acid residues (100%) similar to. th^Homo sapiens 400 amino acid 
residue mitogen-activated protein kinase kinase 2 (EC 2.7.1 .-) (Map kinase kinase 2) 
(MAPKK 2) (ERK activator kinase 2 (ptnr:SWISSPROT-ACC:P36507) (E = 8.2e-'"). 

NOV9 is expressed in at least the following tissues: Adrenal Gland/Suprarenal gland 
Amygdala, Bone, Bone Maxrow, Brain, Colon, Coronary Artery, Dennis. Epidennis, Foreskin. 
Heart, Hypothalamus, Kidney, Uver, Lung. Lymph node. Lymphoid tissue, Mammary 
gland/Breast. Muscle, Nervous. Ovary. Pancreas, Peripheral Blood. Pituitary Gland. Placenta 
Prostate. Retina, Small Intestine. Spleen, Stomach. Testis, Thymus. Tongue, Tonsils, Tumor ' 
UmbiUcal Vein, Uterus. Whole Organism. This information was derived by deteimining the 
tissue sources of the sequences that were included in the invention. Jn addition NOV9 is 

predicted to be expressed in the following tissues because of the expression pattern of a 
closely related Homo sapiens ERK activator kinase (MEK2) mRNA homolog (GENBANK- 

m: gb:GENBAm.ID:HUMMEK2Mr,aoc:Ll 1285): Lymphoid tissue. Nervous t^^^ 
Gastromtestinal tissue. Peripheral Blood, and Cardiovascular tissue. 

NOV9 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 9C. 





Table 9C. BLAST results for NOV9 


Gene Index/ 
Identifier 

gil 13651323 Ire-Ff VP 


Protein/ Organism 


Length 
(aa) 


1 Identity 
(%) 


Positives 
(%) 


Expect.' 


016871.1) 

gi|1348y054 Iref INP 


similar to 
'initogen--activated 
protein kinase 
kinase 2; protein 
kinase, mitogen- 
activated, kinase 
2, p45 (MAP 
kinase kinase 2) 
[Homo sapiens] 


325 ' 


236/236 
(100%) 


236/236 
(100%) 


le-133 


109587,11 


mitogen-activated 
protein kinase 
kinase 2; protein 
kinase, mitogen- 
activated, kinase 
2/ p45 (MAP 
kinase kinase 2) 
[Homo sapiens] 


400 


236/236 
(100%) 


236/236 
(100%) 


le-131 


gi f 1096928 Iprfl 1211 
3192A] 


MEK2 protein 
[Rattus 
norvegicus ; 


400 


229/23 
6 ■ 

(97%) 


235/236 
(99%) 


le-129~ . 
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giil2844163 Idbi IBAB 
26261.11 (AK005392) 


putative [Mus 
musculus] 


401 


229/236 
(97%) 


235/236 
(99%) 


le-129 


giI15990388 IgblAAHl 


Unknown (protein 
for MGC:25475) 
[Mus musculus] 


401 


229/236 - 
(97%) 


235/236 
(99%) 


le-129 


4830.1 IAAH14830 


(BC014830) 



The homology oflhese sequences is shown grapMcaUy in the ClustalWanal>«issho\^ 
in Table 9D. 

Table 9D Information for the ClustalW proteins 

1) NOV9 (SEQ ID NO:28) 

2) Ri|13^?1323|refpa> 016871. 1| similar to mitogen-activated protein kinase kinase 2; protein kinase, mitogen- 
activated, kinase 2, p45 (MAP kinase kinase 2) [Homo sapiens] (SEQ ID NO:75) 

3) P|13489Q54|ref|>ff' 10p587.1| mitogen-activated protein kinase kinase 2; protein Idnase, mitogen-activated, 
kmase 2, p45 (MAP kinase kinase 2) [Homo sapiens] (SEQ ID NO:76) 

4) gill096928lDrfll2113192A MHK? j«rnif^ p,f«,.o ^/^-..^gj^r] (SEQ ID >TO-77) 

5) gill2844163ldbilBAB26261 1 1 (AK009392) putative [Mus musculus] (SEQ ID NO:78) 

S ljl^^^lr^^^'f'^^'^^^"-^'^^^^^^^ (BC014830) Unknown (protein for MGC:25475) [Mus inosculus] 

(S£Q ID NOi79) 



NOV9 

gi 113651323 I 
gi 1 13489054 1 
gl 110969281 
g± 112844163 1 
g± 1 15990388 1 

NOV9 

gi 1 13651323 I 
gi 113489054) 
gi 1 1096928 I 
gi 1 12844163 1 
gi 1 15990388 1 

NOV9 

gi 1 13651323 I 
gi 1 13489054 I 
gi 110969281 
git 128441631 
gi I 15990388} 

NOV9 

gi 113651323 1 
Gfi 113489054 I 
gi 11096928 1 
gi I 128441631 
gi| 15990388) 

NOV9 

gi 113651323) 
gi 1134890541 
gi I 10969281 
gi 112844163) 
gl I 15990388) 

NOV9 

gi 1 136513231 
gi 1134890541 
gi I 10969281 
gi 1128441631 
gi 1159903881 



1 
1 
1 
1 
1 
1 

61 

61 
61 
61 
61 
61 

121 
121 
121 
121 
121 
121 

181 
181 
181 
181 
181 
181 

235 
241 
241 
241 
241 
241 



|LMLARRKPVLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEEL^LDEQQraKRLEAFLTQ 
MLARRKPVLPALTINPTXAEGPSPTSEGASEAl^LVDLQKKLEEL||LDEQQiSKRLEAFLTQ 

mlarrkpvlpaltinptiaegpsptsegaseanlvdlqkkleel|ldeqq!5krleafltq 
mlarrkpvlpaltinptiaegpsptsegaseaivilvdlqkkleelEldeqqSkrleafltq' 

MLARRKPVLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEELgLDEQQjSKRLEAFLTo! 

mlarrkpvlpaltinptiaegpsptsegaseahlvdlokkleelISldeqqSkrleafltoI 



KAKVGELKDDDFERISELGAGNGGVVTKfflfflHRPSGLIMARKLIHLEIKPASRNQIIRELQ 

kakvgelkdddferiselgagnggvvtb^hrpsglimarklihleikpaHrnqiirelq 
kakvgelkdddferiselgagnggwtk^hrpsglimarklihleikpaIIrkqiirelq' 

KAKVGELKDDDFERISELGAGNGGWTKgaHRPSGLIMARKLIHLEIKPABRWQIIRELol 
KAKVGELKDDDFERISELGAGNGGWTKRaHRPSGLIMARKLIHLEIKPABRNQIIRELQ 
KAKVGELKDDDFERISELGAGMGGVVTKSgHRPSGLIMARKLIHLEIKPASRNOIIRELol 



VLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKEAKRIPEraiLGKV^AVLRG 

vlhecnspyivgfygafysdgeisicmehmdggsldqvlkeakripeISilgkvsiavlrg 
ivlhecnspyivgfygafysdgeisicmehmdggsldqvlkeakripeIilgkvsiavlrgI 
vlhecnspyivgfygafysdgeisicmehmdggsldqvlkeakrxpeSilgkvsiavlrg 

VLHECNSPYIVGFYGAFYSDGEISIC^5EHMDGGSLDQVLKEAKRrPESILGKVSIAVLRG 
VLHECHSPYIVGFYGAFYSDGEISICMEHMDGG5LDQVLKEAI<aiIPEglLGI<VSIAVLRGl 



laylrei^hqimhrdvkpsnilvnsrgeiklcdfgvsgqlidsmansfv/gtrsym 

L-AYLREKHQIMHRDVKPSNILVHSRGEIKLCDFGVSGQLIDSMANSFVGTRSYMfiPERLO 

laylrekhqimhrdvkpsnilvnsrgeiklcdfgvsgqlidsmawsfvgtrsymSperlo 
laylreichqimhrdvkpsnilvnsrgeiklcdfgvsgqlidsmansfvgtrsymSperlo! 

LAYLREKHQIMHRDVKPSNILWSRGEIKLCDFGVSGQLIDSMAlxiSFVGTRSyM S PERLO 
LAYLREKHQI MHRDVKPSNI LVNSRGEIKLOr)FnVSf^nT.Tn.QMaMc;p-,r^Tnc.vf.^ « perlqI 







gthysvqsdxwsmglslvelaSgrypip^^ 
gthysvqsdiwsmglslvel^JJgrypipppdakele 

GTHYSVQSDIWSMGLSXi^iAHGP.YEIPPPDAKELE 

gthysvqsdiwsmglslvelaBgrypipppdakele 






FGRPVVDGggGEPHSgSPRPRp 
FGRP V VDGjSgE PH sBs PRPRP 
FGRPWDG^GEPHSfflsPRPRP 
FGRPVVDgSSgEPHsSs PRPRP 



60 

60 

60 

60 • 

60 

60 

120 

120 ' 

120 

120 

120' 

120 

180 
180 
180 
180 
180 
180- 

235 
240 
240 
240 
240 
240 

235 
254 
300 
300 
300 
300 




PPPKLRBG 
GHGtyiDSRPAMAIFELLDYIVNEPPPKLMG 

ghgmdsrpamaifelldyitoepppklpBg 
ghgmdsrpamaifell'dyivnepppklpSg 
ghgmdsrpamaifelldyivnepppklpIgI 




NOV9 2 64 [S52Sm]3t|22t| 

gi 1 13651323 | 285 *DGBEGEPHiS,'I's!pRpBppBR 
gi 113489054) 357 
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gi 1 1096928 I 357 
gi 1 12844163 I 358 
gil 159903881 357 



FIKRSEGEEVDFAGWLCgTLRLgQPSTPTRTAV 
FIKRSEGEEVDFAGWLCgTLRLgQPSTPTRTAV 
FIKRSEGEEVDFAGWLC StLRlBqPSTPTRTAV 



400 
401 
400 



Tables 9E and 9F list the domain description from DOMAIN analysis results against 
NOV9. This indicates that the NOV9 sequence has propertied similar to those of other 
proteins known to contain these domains. 



Table 9E, Domain Analysis of NO V9 

Slf^^^^r^''^^^^^ - ^ serine/Threonine protein kinases/ . - - 

catalytic domain; Phosphotransferases. Serine or threonine-sp^cific 
kinase subfamily. (SEQ ID NO: 99) teo«iiie speciric 

Length * 256 residues, 100.0% aligned 
Score = 184 bits (468), Expect = 5e-48 



N0V9 

00220 

NOV9 

00220 

N0V9 

00220 

N0V9 

00220 

NOV9 

00220 



72 FERISELGAGNGGVVTKVQHRPSGLIM^ 

■ ' ' ' ' ^ ' + + +1 ++) !+l I +) 1 +1+1 I+++I + + 1 I . 

1 YELLEVLGKGAFGKVYIARDKKTGKLVAIKVIKKEKLKKKKR^ , 60 ^ 

131 VGFYGAETSDGEfsiCMEHMDGGSLDQVlJCEAKRIPEEIL^ 190 

^ i i I -^-^ 11+ +11 I +11+ 1+ 1+ + +1 ( ,1 I 

61 VKLYDVFEDDDKLYLVMEYCEGGDLFDLLKKRGRLSEDEARFYARQI^ 119 

191 MHRDVKPSNILVNSRGEXK^^ 

+111+11 III++I 1 +11 II + +11,1 ,jn -^-^^ 

120 IHRDLKPENILLDSDGHVKIJ^reLAKQLDSGGTLLTTFVGTPEYMAPEVL^^ 179 

237 

-PPKLPNGVFTPDFQEFVNKCLIK 259 

IBO DIWSLGVILYELLTGKPPFPGDDQLIJ^FKKlGKPPpiFPPPEWKIS^^ 239 

260 NPAERADLK^4LTNHTF1 276 

+1 +1 + I i 
240 DPEKRLTAEETU^HPFF 256 



Table 9F. Domain Analysis of NOV9 

Score 



.256 residues, 100.0% aligned 
165 bits (418), Expect = 3e-42 



NOV9 

00069 

NOV9 

00069 

NOV9 - 

00069' 

N0V9 

00069 

N0V9 

00069 



72 



FERISELGAGNGG^^ 131 
^1 T| i-t-i III +1+ +(( + ) ++. .- . '-+ + +11 + J + I u-iiii 
1 YELGEKLGSGAFGKVnfKGKHKDTGEIVAIKII^LSEKKKRElii^ / go 

132 '^^SAEYSIK^EISiqyiEHMDGG^^^^ 

' ' ' + + II+I + I! I + +1+ 1+++ +11 M M J. I 

61 RLLGVFEEDDHLYLVMEYMEGGDLFDyLRI«,GIiLSEKEAKilALQj^^^^ 119 

X91 MHRDVKPSNILVNSRGEIKLC^ 

+1 1 l+l I . 1 1 I++ I +)+ Mil 
■120 VHRDI*PENlLLDENGT7KIADFGIJU«a^SSSyEKLTTFVGTI.E4^ I79 

237 • 

" • . " PPKLPNGVFTPDFQEFVNKCLik 259 

180 VD^mSLGVlLYELLTGKLPFPGIDPLEELFRlKERPRI^LPLPPNCS^^^ 239 

260 NPAERADLKMLTNHTFI 276 

+ 1 +1 •. I + 1 I + ' . 

240 DPEKRPTAKEILNHPWF 256 



The amino acid sequence of NOV9 has high homology to other proteins as sho 
10 Table 9G. • . 



m 
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Table 9G. BLASTX results for NOV9 


Reading High 

Sequences producing Hiqh-scoring Segment Pairs; Frame 


Smallest 
Sum 
Prob 
Score P{N) 


N 


patp:AAY41652 Human MEK2 protein sequenc, Homo Sapi 400 aa.. +3 
patp;AAW88434 Dis ass prot kinase DAPK-3, Homo Sapi 400 aa.. +3 


1194 4.8e-160 
1186 3.3e-159 


1 
1 



10 



15 



suggest that NOV9 may have important structural and/or physiological functions characteristic 
of the Mitogen Activated Protein Kinase Kinase 2 protein family. Therefore, the NOV9 
nucleic acids and proteins of the invention are useful in potential therapeutic applications 
implicated in various diseases and disorders described below and/or other pathologies. For 
example, the NOV9 compositions of the present invention will have efficacy for treatment of 
patients suffering from atherosclerosis, metabolic diseases, pathogen infections and 
neurological diseases. The NOV9 nucleic acid encoding Mitogen Activated Protein Kinase 
Kinase 2-Iike protein, and the Mitogen Activated Protein Kinase Kinase 2-like protein of the 
invaition, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 
NOVIO 

A disclosed NOVl 0 nucleic acid of 1 506 nucleotides (also referred to 
sggc_drafl_c333el_20000804_da2) encoding a zinc finger protein 276 C2H2 type-like protein 
is shown in Table lOA. An open reading frame was identified beginning with an ATG 
initiation codon at nucleotides. 385-387 and ending with a TGA codon at nucleotides 1504- 
1506. A putative untranslated region i5)stream from the intiation codon is underlined in Table 
lOA, and the start and stop codons are in bold letters. 



Table IDA. NOVIO Nucleotide Sequence (SEQ ID NO:29) 



TZr... o^^^i '^v^/itoftL,wr c ;cCGTACGACGAGCACTGTGAACCTCCG C CT6CTTGTCOP!nfypr 

^'^!^.':"^^^^'''^^^^''''^^^^^^°^^ '^<^^^<^^^g^^^^gCACAGGGGCAGAG6AGGGScCT^^ 
TGA6TCCTCCCCCGGTGGA66GTGG GCTGGGTGCCGACCA6CCGTGGATCTGACATrTr T^TT^n 7i?i?g 

CAGCTGCGGGGCCCTACCCCACCTTCAGA6GACACTGTrrTrnr!2iriTi^nn.cn,=/^r>na...nm^^,^^»- 



bbATCTGATCACATCCA6CCCCCAGT G CCTGCACGGC;TT6GTGGGGT< 3SGTGr .ATnP;AnaTr;nrrn.n 

^^^^^"^^^^^^^ggGCCACGACTACACC^TGGATACCAGCTCCAGCTGC^GGCCTTCTTGCTGScAGT 
S^^^^^<^<=^°^^^'2TGGCCATGGGACAAAGAGACGGCGCCACGGCTGCCCCAGCACCGAGGGTGGAACCC 
^^^^^^^'''''''''''''^'^^^^^^^^^^'^^'^'^'^'^^GGGACCCCAGTTGGGGCTGAGACCAA^ 

S^^S^^°^^^'^^^®°^°®^^^^^<^°'=^*^'^''<^'5<5TGAGAAGCAGCTTCCATCTTCAACCTCGGATGA 
!™^^^*^^^^*^^^^'^^®^^^^^^^^°^°°'=AGACGTCTTGAGTGAAGATGAAAATGACAAGAAGC 

G^^^SS^^*^''^^'^'^'^'''''^*''^^^^°<^^S^'5CCGGGACCCAAGCCCGGATGGAAGAA 

rAr^?n^^So^^^^^*^°^°^°^^^^^^^^'^*^^CCACGAGGAGGTCCGGGAGCGGCCCTGCCCC 
£^SSE!?<^='^'^=^=^'52TTTTCATGATCGACCGCTACCTGCAGCGCCACGTGAAGCTCATCCACACAr,a 



TGrrArf^^™J^^'^^°*^^^^'^^'^°*^^^^TTCAAGCAGCGGAAGCACCTTCTCGTCCACC^ 
AAr?A^Io^S°°^'^^^^'^^^^'^^^^^°^°^'^'^^°TCTGTGG6TTCCAGTGCAGGCAGCGGGCATCC^ 
^ArAA™^^^^^^^^°°^'^'^*'^*^^°^'^^TG6ACTTTGCCTGTGACCAGTGTGGCCGGC6GTT 

TGCCCCTGGAGGCGGAACCACC ACCTGGGCCACCGAGCCCCTCTGT6ACCACA6AGGGCCA6GCGGT6AAG 
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The disclosed NOVIO nucleic acid sequence, localized to chromosome 16, has 271 of 
271 bases (100%) identical ^omo sapiens Fanconi anaemia group A gene, exons 39, 40, 41, 
42 and 43 mRNA (gb:GENBANK.ID:HSZ83095|acc:Z83095) (E = 9.4e-^^). 

A disclosed NdyiO polypeptide (SEQ ID NO:30) encoded by SEQ ID NO:29 is 373 
amino acid residues and is presented using the one-letter amino acid code in Table lOB. 
Signal P, Psort and/or Hydropathy resuhs predict that NOVIO does not contain a signal- 
peptide and is likely to be localized at the mitochondrial matrix space with a certainty of 
0.5517. 



Table lOB, Encoded NOVIO prptem seqnence (SEQ ID NO:30). 




KKHlKEHHEEVRERPCPHPGCNKVFMIDRYLQRHVKLIHTEVWlYICDECGm^O^ 



The NOVIO amino acid sequence has 310 of 373 amino acid residues (83%) identical 
to, and 325 of 373 amino acid residues (87%) similar to, the Mus musculus 372 amino acid 
residue zinc finger protein 276 C2H2 type (ptnr:TREMBLNEW-ACC:AAG01634)(B = e.Se 

.... ... • • . 

NOVIO is expressed in at least the foUowing tissues: bone mairow, brain, cerv^^ 
colon, coronary artery, heart, hj^othalamus, kidney, lymph node, lung, ovaiy. peripheral 
blood, prostate, testis, thyroid, tonsils, uterus and whole organism, 

The disclosed NOVl 0 polypeptide has homology to the amino acid sequa^^^ 
m the BLASTP data Ksted in Table IOC. 





Table IOC. BLAST results for NOVIO 


Gene Index/ 
Identifier 

gi 1 10048420 i ref INP 


Protein/ Organism 


Length 
(aa) ^ 


Identity 

(%). 


Positives 


Expect 


065243.11 

gil 11611571 Idbi IBAB 


zinc finger 
protein (C2H2 
type) 276 [Mus. 
musculus] 


372 


310/374 
(82%) 


325/374 
(86%) 


le-155 


19000.11 (AB05214S^ 
5:i 1147767421 ref IXP 


hypothetical 
protein [Macaca 
fascicularis] 


.298 ■ • 


251/253 
(99%) 


252/253 
(99%) 


le-121 


047520.11 

gil 11611570 Idbi IBAB 


hypothetical 
protein XP_047520 
[Homo sapiens] 


400 


253/253 
(100%) 


253/253 
(100%) 


le-120 


18999.11 (7lB05214.'=iV 
gi 1 15559662 | gb I AAHl 


hypothetical 
protein [Macaca 
fascicularis] 


280 


104/110 

(94%) 


106/110 
(95%) 


8e-53 


4187.1 IAAH14187 


Unknown (protein 
for MGC:'20975) 
[Homo sapiens] 


615 


86/226 
(38%) ' 


127/226 


7e-38 


(BC014187) • 




(56%) 
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The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table lOD. 



Table lOD. ClustalW Analysis of NOVIO 

1) Novel NOVlO (SEQmNO:30) 

2) gi|lQQ4842Q|ref|NP 065243.1| zinc finger protein (C2H2 type) 276 [Mus musculus] (SEQ ID NO-80) 
^) Ki|116n571|dbi|BABl?Q00.1[ (AB052145) hypothetical protein [Macaca fascicularis] (SEQ ID NO-81) 
V gi|14776742MXP 047520.|| hypothetical protein XP_047520 [Homo sapiens] (SEQ ID NO-82) 

5) ffljn 611570|dbi|BAB18999.1| (AB052145) hypothetical protein [Macaca fesciculaiis] (SEQ ID NO-83) 

(BC014187) Unknown (protein for MGC:20975) [Homo sapiens] 
(SEQ ID NO:84) . • ^ 



NOVlO 1 

gi 1 10048420 I 1 

gi I 11611571 I 1 

gi 1 14776742 I 1 

gi 1 11611570 I 1 

gi 1 15559662 1 1 

NOVlO 1 

gi I 100484201 1 

gi I 11611571 I 1 

gi 114776742 1 1 

gi I 11611570 I 1 



MAERALEPEAEAEAEAGAGGEAAAEEGAAGRKARGRPMiTESDR^ 60 



1 
1 
1 
1 



gi 1 15559662 1 61 YLGEAHGPWDIJlRRSGWSDAKIAAYLISIiERGQRSGRHGKPWEQVPKKPKRK^^ 120 



NOVlO 

gi 1 100484201 
gi 1 11611571 1 
gi 114776742 I 
gi 1 11611570 ( 
git 15559662 I 

NOVlO 

gi 1 100484201 
gi 1 11611571 1 
gi 114776742 I 
gi 1 11611570 1 
gi 1 15559662 1 

NOVlO 

gi 1 10048420 1 
gi 1116115711 
gi 114776742 I 
gi 1 11611570 1 
gi 115559662 I 

NOVlO 

gi]10048420| 
gi 1 11611671 1 
gi|l4776742| 
gi 1 11611570 I 
gi 1 15559662 1 

NOVlO 

gi 1 10048420 1 
gi 1 116115711 
gi 1 147767421 
gi 111611570 I 
gi 1 155596621 

NOVlO 

gi 1 100484201 
gi 1 116115711 
gi 114776742 I 



-MDTSS SCKAF 10 

-MDTAS SCRAL 10 



MDMRPAAGPCPTFRGH 16 
121 NCLKNyVIWYEDHKHRCPYEPHLAELDPTFGLYTTAVWQCEAGHRYF ^ 



PLSDS 180 



11 
11 
1 

17 
1 



LLDSaLAVKWPWD KETAPRLPQHRGWNPGD^QTSQGRGTGTPVffl-AEOkTLP 62 

FLOgALAVKWAWG KDLSPRLAQNSESNPTC^SRLCQ-ARETQVg-SBdlCTLP 61 



CPPgriU^SRSCGAATRATTTPWIPAPAZU^PSCWTVRWQSSGHG-TKRRRHl 



-MGHCRLCHGKFSSRSLRG-ISERA] 




CPJ^EGG 74 
jSVeRP 31 



181 DPpgDKVGNGLVAGSSDSSSSGSASDSEESPEGQPVKkAAAAAAATPTSPVgSSGLITQE 240 

ID^AQPPSDSiAVGPR IZj 70 

l^DVALLHSHC^SVGPG ,^ , .-^--.^ . III 79 



63 
62 

•1 - 
75 
32 



OtLQgPLRPPRVgGQGPQ- 
SAEERVLVRDFQR-L- 



1 

91 
45 



241 GVH:^PFDVHH\^SIJ^QGTPLCSNPAGNGPEALETVVCVPVPVQVGAGPSALraW^ 300 



80 
79 
1 

92 
46 
301 

109 
108 
32 
134 

63 



FPPQPSi>LC 

PC TQPHLgpSEA 

mrpsllqtBtrwg 

IlrprpcparmwpsllrtHtrwg 

jvSvRQD 

|EWASCPMPGMVPGSqy;ilIAG 



S CfflGEKQLP^STSDD— 

■jGETQVPjS'STSDD— 
flASHBSjaACPS2vjGPQG{ 

SSGacpmgpqg[ 

— - — LSQFVCI 
DALTAEGIHLl 



109 

108 

• 32 

134 

g3 

TGS6LGEEVPCAMME 360 



copras DLS — 

CD^gSDLs — eSs: 
isrshlqprImi 

IfSSgHLQPRMIgS 
-CHAjDHYQCHSiLiLRS 




\qssdesfefypeBkvsgkks 

gSSDESFEPYPESfCVSGKKS 
SSDESFEPYPE gKVSGKKS 
QSSDESFEPYPE SKVSGKKs! 



155 
154 
80 
182 



361 GVAAYTQTEPEGSQPSTMDATA\^GlB^ JJo 




glRKKPGPKPGWKKKLRCEREELPTIYKCPYQGCrAVYRG 
SiRKKPGPKPGWPCKKLRCEREELPTIYKCPYQGCTAVYRG 
glRKKPGPKPGWKKKLRCEREELPTIYKCPYQGCTAVYRG 
llRKKPGPKPGWKKKLRCEREELPTlYKCPYQGCTAVYRG 
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gi 1 15559662 I 421 PE^DGEf LDGSDMSAIIYeBpJ^EI^RrSIrS^^^ 



■hEqktlsseycg 



JigA 155 
VA 480 



NOVIO ■ 207 

giI10048420J .206 

gi I 11611571 1 132 

gi 1 14776742 I 234 «„^,»„.. «^aK..»»is>^ 



ADGHpaiIE2:HIIE2VRERFCFHPGCWir^raiDP//L^^^^^^ 
fiD.--hK^mfaiIlEEVRERPCPHrGCMK^ 



266 

265 
191 

^ 293 

Pjf?DiCETAPiR 195 

■ 539 



NOVIO 267 

g± 1 10048420 1 266 

gi 1 11611571 1 192 

gi 1 14776742 I 294 

gi 1 11611570 I 195 - 

gi 1 15559662 I 540 




•7VETELDFACDQCGR 



iSDAVGPRSGE 



326 
325 
251 
353 
252 
599 



NOVIO 

gi 1 10048420 I 
gi 1 11611571 1 
gif 147767421 
gi 111611570 1 . 
gi I 15559662 | 



327 
326 
252 
354 

253 (S||P1,CGJ^PGQ: 
600 ISVKBfrLKSfflDJ 



••HJJVll^]yMVi^^ffl1xSy;;aA.|,PI,I.•Al•.Fpp^;t 

:.JLNVIiMSIWiir-gTcSv!:jALFLE/-iEPPPGt 
■JLHVHMSirvrHrgToiSQL-B^iLPLEAEPPr-nr 



iQVPSSTSDjqRRRLE 
PT — 




Table 1 OE lists the domain description from DOMAIN analysis results against 
NOVIO. This indicates that the NOVIO sequence has properties similar to .those of other 
proteins known to contain these domains. 



Table lOE. Domain Analysis of NOVIO 

,gnl|Pfam|pfam00096, zf-C2H2, Zinc finger, C2H2 type The C2H2 zine 
finger is the classical zinc finger domain. The^o conle^ed " 
cysteines and histidines co-ordinate a zinc ion. The follSo i,att««, 
describes the zinc finger. #-X-C-X (1-5) -C-X3-#-x5-#!x2-H-xT3 Il^m/r^ 
Where X can be any a:„ino acid, and nuj,ers in b^fcLS inJ^cati'S^ 
number of residues. The positions marked # are those that are 
omportant for the stable fold of the zinc fingerV The fiLl position 
tnlr-^^^t^''^'^ ''^f ''^ ^^^^ ^2fl2 zinc finge? L co^sed o? two 

S Se hilif b^ds '"""'^ ""^^ te^nal part 

ID iolxm the maDor groove in DMA binding zinc fingers. (Ieq 

Length = 23 residues, 100.0% aligned 
Score. « 35.8 bits (81), Expect = 0.004 



NOVIO 



00096 



255 YICDECGQTFKQRKHLLVHQMRH 
I 1+1 ++ +1 I I 
1 YKCPDCGKSFSRKSNLKRHIiRTH 



277 



23 



The protein similarity infonnation, expression pattern, and map location for the 
NOVIO suggest that NOVIO may have important structural and/or physiological functions 
characteristic of the zinc finger protein 276 C2H2 protein family. Therefore, the NOVIO 
nucleic acids and proteins of the invention are useful in potential therapeutic appKcations' 
implicated in various diseases and disorders described below and/or other pathologies. For 
example, the NOVl 0 compositions of the present invention wiU have efiBcacy for treatment of 
ring from cancer, trauma, immunoloeical disRas*, ~e«t^o+««, ,?^o«»„^ v . 



w V w ^uuipusiuom 01 me present mvention will have efiBcacy for treatni. 
patients suffering from cancer, trauma, immunological disease, respiratory disease, heart 
disease, gastro-intestinal diseases, reproductive health, neurological and neurodegenerative 
Diarrow transplantation, metabolic 



15 diseases, bone marrow transplantation, metabolic and endocrine diseases, aUergy and 
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mflammation, nephrological disorders, hematopoietic disorders and urinary system disorders. 
The NOVIO nucleic acid encoding zinc iSnger protein 276 C2H2 type-like protein, and the 
zinc finger protein 276 C2H2 type-like protein of the invention, or fragments thereof may 
further be useful in diagnostic ^plications, wherein the presence or amount of the nucleic acid 
or the protein are to be assessed. 

NOVll 

NOVl 1 includes two novel Thymosin beta-lO-like proteins disclosed below. The 
disclosed proteins have been named NOVl la and NOVl lb. 

NOVlla 

A disclosed NQVl la nucleic acid of 129 nucleotides (also ref^ed to 
GMAC079400_A) encoding a novel Thymosin beta-lO-like protein is shown in Table 1 lA. 
An open reading fi-ame was identified beginning with an ATG initiation codon at nucleotides 
28-30 and ending with a TAA codon at nucleotides 157-159. Putative untranslated regions 
upstream fixjm the initiation codon and downstream &om the termination codon are underlined 
in Table 1 1 A, and the start and stop codons are in bold letters. 



Table llA. NOVlla Nucleotide Sequence (SEQ ED NO:31) 

ACGGATGGTACCGATTGTTT TAftGAAAaTGGCAGACAAACCAGACGTGGG GGGAATCGCCaGCTTCflATA 

GGGCCAAGCTGAAGAAAACGQAGACGCAGGAGAAGAACACCCTGCCGACCAAAGAGACCACTGGGCA(3AA 
GCGGAGTGflAATTTCCiaaGAGCCCGGAGGATTTCCTGCCCTCGTC v,u«v,i,=»,toWifaAA 



The disclosed NOVl la nucleic acid sequence has 172 of 190 bases (90%) identical to 
a Homo sapiens Thymosin beta-1 0 mRNA (GENBANK-ID: S54005) (E = 3. ie:^'). 

A disclosed NOVl la polypeptide (SEQ ID NO:32) encoded by SEQ ID NO:3 1 is 43 
amino acid residues and is presented using the one-letter amino acid code in Table IIB. 
Signal P, Psort and/or Hydropathy results predict tiiat NOVlla does not contain a signal 
peptide and is likely to be localized to the nucleus with a certainty of 0.5426 



Table IIB. Encoded NOVlla protein sequence (SEQ ID NO:32). 



MADKPDVGGIAS FNRAKLKKTETQEKNTLPTKETTGQKR; 



,SEIS 



The NOVl la amino acid sequence has 37 of 44 amino add residues (84%) identical 
to, and 40 of 44 amino acid residues (90%) similar to, flie Ratius norvegicus 44 amino acid 
residue Thjraosin beta-10 protein (A27266) (E = 2.4e'^). The global sequence homology is 
88.372% amino acid homology and 86.047% amino acid identity. 

NOVlla is predicted to be expressed in the Metastatic Melanoma tissues because of 
the expression pattern of a closely related Homo sapiens Thymosin beta-10 homolog 



75 



BNSOOCID: <WO_02055702A2_L> 



wo 02/055702 



10 



15 



20 



25 



PGT/USOl/50925 



30 



(GENBANK-ID: S54005). 
NOVllb 

A disclosed NOVllb nucleic acid of 173 nucleotides (also refeixed to CGI 09754-01) 
encoding a novel Thymosin beta-1 0-like protein is shown in Table 1 IC. An open reading 
frame was identified begmning with an ATG initiation codon at nucleotides 27-29 and aiding 
with a TAA codon at nucleotides 156-158. Putative untranslated Regions upstream fiom the 
mitiation codon and downstream from the temiination codon ai^ underlined in Table IIC. and 
the start and stop codons are in bold letters. 



Table lie. NOVllb Nucleotide Sequence (SEQ ID NO:33) 



— tjbijucucc y^OMljKi W SyKJlSi) 



The disclosed NOVl lb nucleic acid sequence, localized to chromosome 2; has 155 of 
168 bases (92%) identical to a Homo sapiens Thymosin beta-10 mRNA (gb:GENBANK- 
ID:HIJMTHMBX|acc:M92381.1) (E = 4.1e-2^). 

A disclosed NOyi lb polypeptide (SEQ ID NO:34) encoded by SEQ ID NO-33 is 43 
ammo acid residues and is presented using the one-letter amino acid code in Table IID 
Signal P. Psort and/or Hydropathy results predict that NOVllb does not contain a signal 
P^de and is likely to be localized to the nucl^s with a certainty of 0.5426 Although 
PSORT suggests the NOVllb polypeptide may be localized in the nucleus, the NOVllb 
proteinis similarto the THj^osin family, somemember. of which are released extr^ellularly 
l^erefore it is likdy that this novel Tlxymosin Beta 10-hke protein is localized to the 
extracellular space. 




• ^«NOVllbaminoacidsequencehas36of43aminoacidresidues(83%)identical 
to, and39 of 43 amino acid residues (90o/o) similar to, the 43 amino acid 

residue Thymosin beta-10 protein (p;tiir:SWISSNEW-ACC:P13472)(B = 17e-^2) ^^^^^^ 

protein is 43 amino acids long, which is the same length as public protein P13472 NOVllb 

^ot^ differ, at eight amino acid positions. NOVll b begins with a methionine that the pubfic 
Ge^ank submission is lacking. In addition to this, there are six single amino acid changes 
(M6V, E8G, D14N, K15R, D4T, E35G) and a single amino acid deletion (E37-). This number 
of changes in such a shortpeptide indicates, that NOVl lb protein is derived fiom a different 
gene than the public protein. 
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NOVl lb is predicted to be expressed in brain and neuroblastoma tissues because of the 
expression pattern of a closely related Homo sapiens Thymosin beta- 10 homolog 
(GENBANK-ID: gb:GENBA]^-ID:HUMTHMBX|acc:M92381.1). 

NOVl 1 a and NOVl lb are very closely homologous as is shown in the anuno acid 
alignment in Table 1 IE. 

Table HE Nucleic Acid Alignment of NOVlla and NOVllb 



10 

. 1 . . 



20 



30 



NOVllA 
NOVXIB 



NOVllA 
NOVllB 



NOVllA 
NOVllB 



NOVllA 
NOVllB 



40 



50 
. I 



uGGATGGTACCGATTGTTTTAAGAARATGGCAGACAAACCRGACGTGGG 
CGGATGGTACCGATTGTTTTAAGAAAATGGCAGACAAACCAGACGTGGG 



60 70 80 90 100 
. I I i I I I 1 I I 



GGGAATCGCCAGCTTCAATAGGGCCAAGCTGAAGAAAACGGAGACGCAGG 
GGGAATCGCCAGCTTCAATAGGGCCAAGCTGAAGAAAACGGAGACGCAGG 



110 



120 



130 



140 



150 



AGAAGAACACCCTGCCGACCAAAGAGACCACTGGGCAGAAGCGGAGTGAA 
AGAAGAACACCCTGCCGACCAAAGAGACCACTGGGCAGAAGCGGAGTGAA 



160 



170 
► . I . . 



ATTTCCTAAGAGCCCGGAGGATTT 
ATTTCCTAAGAGCCCGGAGGATTT 



180 

I I I. 

:CTGCCCTCGTC 



Homologies to any of the above NOVl 1 proteins will be shared by the other NOVl 1 
proteins insofar as they are homologous to each other as shown above. Any reference to 
NOVl 1 is assumed to refer to both of the NOVl 1 proteins in general, unless otherwise noted. 

NOVl la also has homology to the amino acid sequences shown in the BLASTP data 
UstedinTablellF. 



Table IIF. BLAST results for NOVlla 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 339697 I ab 1 AAA367 
46.11 (M92383) 


thymosin beta-10 
[Homo sapiens] • 


49 


37/44 
(84%) 


40/44 
(90%) 


4e-04 


gill0863895|ref INP 
066926.11 


thymosin, beta 10 
[Homo sapiens] 


42 


34/42 
(80%) 


39/42 
(91%) 


0.002 


giI223789|prf 1 10912 
169A 


thymosin 
betalO, Arg 
[Oryctolagus 
ciiniculus] 


44 


37/44 
(84%) 


40/44 
(90%) 


0.005 


gi 1 2143995 Ipirl 1152 
084 


thymosin beta-4 
precursor 
(fragment) 
[Rattus 
norvegicus] 


. 43 


36/43 
(83%) 


39/43 
(89%) 


0.019 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table IIG. . 



Table llG Information for the ClustalW proteins 

1) NOVl la (SEQ ID NO:32) 
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2) ^|339697|gb[AAA36746.1| (M92383) thymosin beta-10 [Homo sapiens] (SEQ ID NO-85) 

3) gi|10863895|ref]NP Q66926.1| thymosin, beta lO [Homo sapient] (SEQ ID NO:86)^ 

^) si|223789|prfl|0912169A thymosin betalO,Arg [Oryctolagus cuniculus] (SEQ ID NO:87) . . 
5) Ri|2143995|pir||I52084 thymosin beta-4 precursor (fragment) [Rattus norvegicus] (SEQ ID NO:88) 



NOVlla 1 

gi I 339697 I 1 

gi 1 10863895 1 1 

gi 12237891 1 

gl 12143995 1 1 



-DCFKK 



LFAQLAQLLPAT 



MADKPDaGglASFj^UjJAKLKKTETQEKNTLPTKET 
MADKPDMGEIASFDKAKLKKTETQEKNTLPTKETIEQEKRSEIS 

madkpdmgeiasfdkaklkktetqekntlptketieqekrseis 
Badkpdmgeiasfdkaklkktetqekntlptecetieqekrseis 

MgPKPDMBEltgSFDKgKLKKTETQEKNBLPBKETIEQEKlg^S 



■ The amino acid sequence of NOVl 1 has high homology to other proteins as shown i 
Table IIH. - .. . . 



m 



Table IIH. BLASTX results for NOVll 



Sequences producing High-scorinq Segment Pairs 



palpAAY80267Tliyinosm beta 4 peptide isofiOTnTbetalO, Onlcnown 43 aa~ 



Reading High 
Frame 



Smallest 
Sum. . .. 
Prob 
Score P(N) 



+1 



169 



7.2e-12 



The protein similarity information, expression pattern, and niap locatioid for the 
NOVl 1 protein and nucleic acid suggest that NOVl 1 may have important structural and/or 
physiological functions characteristic of the Thymosin beta 10 family. Therefore, the NOVl 1 
nucleic acids and proteins of the invention are useful in potential therapeutic applicatidns 
impUcated in various diseases and disorders described below and/or other pathologies. For 
example, the NOVl 1 compositions of the present invention will have efficacy for treatment of 
patients suffering from prostate cancer, immunological and autoimmune disorders (ie ' 
hyperthyroidism), angiogenesis and wound healing, modulation of apoptosis, • 
. neurodegenerative and neuropsychiatric disorders, age-related disorders, pathological ' 
disorders involving spleen, thymus, lung, and peritoneal macrophages and/or other pathologies 
and disorders. The NOVn nucleic acid encoding Thymosin beta 10-like protein,"^=d^^^^ 
Thymosin beta 10-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

NOVX Nucleic Acids and Polypeptides 

- One aspect ofthe invention pertaim to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof Also included in the invention are^ 
nucleic acid fragments sufficient for use as hybridization probes to identify NOVX-encoding 
nucleic acids (eg^., NOVX mRNAs)aiid fragments for use as PGR primers for the ' 
amplification and/or mutation of NOVX nucleic acid molecules; As used herein, the term 
"nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic 
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DNA), RNA molecules {e.g., mRNA), analogs of the DNA or RNA generated using 
nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic acid 
molecule may be single-stranded or double-stranded, but preferably is comprised double- 
stranded DNA. 

An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 
•'mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the frjll-length 
gene product, encoded by the corresponding gene. Altematively, it may be defined as the 
polypeptide, preciirsor or proprotein encoded by an ORF described herein. The product 
"mature" fonn arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" form 
of a polypeptide or protein include the cleavage of the N-teiminal. methionine residue encoded 
by the initiation codon of an ORF, or the proteolytic cleavage of a signal peptide or leader V 
sequence. Thus a mature form arismg firom a precursor polypeptide or protdn that has : 
residues 1 to.N, where jesidue 1 is the N-temiinal methionine, would have residues 2 through 
N remaining after removal of the N-teiminal methionine. Altematively, a mature form arising 
from a precursor polypeptide or protein having residues 1 to N, in which an N-terminal signal 
sequence from residue 1 to residue M is cleaved, would have the residues from residue M+1 to 
residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein may 
arise from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non-limiting example, glycosylation, 
myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the " 
detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtamed fix)m a natural or recombinant source, are highly specific, and 
much slower to hybridize than shorter-length oUgomer probes. Probes may be single- or 
double-stranded and designed to have specificity in PGR, membrane-based hybridization 
technologies, or ELISA-like technologies. 
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• The term "isolated" nucleic acid molecule, as utiliM herein,' is one- which is separated 
from other nucleic acid molecules which are preswit in the natm^d source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (i.e., sequences located at the 5'- and 3'-termini of the nucleic acid) in the genomic DNA 
5 of the organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated NOVX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 
kbi.0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g^., brain, heart, Uver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can 
10 be substantially free of other cellular material or culture medium when produced by - 
recombinant techniques, or of chemical precursors bir other chemicals whan chemically 
synthesized. . ; . ;. 

A nucleic acid molecule of the invention, e.g.i a nucleic acid molecule having the 
nucleotide sequence SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15,17, 19, 21. 23, 25, 27, 29; 31 and 

15 ; 33, or a complement of this aforementioned nucleotide sequence, can be isolated using ' ■ 

standard molecular biology techniques and the sequence information provided herein. Using 
all or a portion of the nucleic acid sequence of SEQ ID NOS:l, 3, 5, 7,' 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29. 3 1 and 33 as a hybridization probe, NOVX molecules can be isolate using 
standard hybridization and cloning techniques (e.g. , as described in Sambrook, et dl., (eds.), 

20 MOLECULAR Cloning: A Laboratory Manual 2»^ Ed., Cold Spring Harbor Laboiatoiy 
Press, Cold Spring Harbor, NY, 1989; and Ausubel, a/., (eds.), CURRENT Protocols IN 
. Molecular Biology, John Wiley & Sons, New York, NY; 1993.) . : . : 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to st^dard 
25 PGR amplification techniques. The nucleic acid so amphfied can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 

oHgonucleotides corresponding to NOVX nucleotide sequences can be prepared by standard 
synthetic technique's, e.^., using an autoniated DNA syrithesizer. ' " 

: As. used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
30 residues, which oligonucleotide has a sufficient number of nucleotide bases to be used hi a 
PGR reaction. A short oUgonucleoitide sequence may be based on, or designed from, a ' 
genomic or cDNA sequence and is used to ampUfy, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA m a particular cell or tissue. ■ - 
Oligonucleotides comprise portions of a nucleic acid sequence havmg about 10 nt, 50 nt; or 
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100 nt in length, preferably about 15 nt to 30 nt in lengtii. In one embodimait of the 
invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 
would further comprise at least 6 contiguous nucleotides SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 
17, ,19, 21, 23, 25, 27, 29, 31 and 33, or a complement thereof. Oligonucleotides may be 
chemically synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic iacid molecule that is a complement of the nucleotide sequence shown in SEQ ID 
NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33, or a portion of this 
nucleotide sequence (e.g. , a fragment that can be used as a probe or primer or a fragment 
encoding a biologically-active portion of an NOVX polypeptide). A nucleic acid molecule 
that is complementary to the nucleotide sequence shown NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29, 31 or 33 is one that is sufficiently complementary to the nucleotide 
sequence shown NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 or 33 that it can 
hydrogen bond with little or no mismatches to flie nucleotide sequence shown SEQ ID NOS:l, 
3, 5, 7, 9. 1 1, 13, 15, 1 7, 19, 2 1, 23, 25, 27, 29, 3 1 and 33, thereby forming a stable duplex. / . 

As Tised herein, the term "conq)lementary" refers to Watson-Crick or Hoogsteen base . 
pairing between nucleotides units of a nucleic acid molecule, and the term 'T)inding" means . 
the physical or chemical interaction between two polypq)tides or coiiq)ounds or associated 
polypeptides or compounds or combinations thereof Binding includes ionic, non-ionic, van 
der Waals, hydrophobic interactions, and the like; A physical interaction can be either direct 
or indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. - 

Fragments provided herem are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific 
hybridization in the case of nucleic acids or for specific recognition of an epitope m the case of 
amino acids, respectively, and are at most some portion less than a fiiU length sequence. 
Fragments may be derived fix>m any contiguous portion of a nucleic acid or amino acid 
sequence of choice. Derivatives are nucleic acid sequences or amino acid sequences formed 
from the native compounds either directly or by modification or partial substitution. Analogs 
are nucleic acid sequences or amino acid sequences that have a structure similar to, but not 
identical to, tiie native compound but differs from it in respect to certain components or side 
chains. Analogs may be synthetic or fix>m a different evolutionary origin and may have a 
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similar or opposite metabolic.actiyity. compared to. wild type:oHomologs are nucleic acid 
sequences or amino acid sequences of a particular gene that are derived from different species. 

Derivatives and analogs may be fuU length or other than fuU length, if the derivative 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
5 analogs of the nucleic acids or proteins of the invention include, but are not limited to, 
molecules comprising regions that are substantially homologous to the nucleic adds or 
proteins of the invaition, in various embodiments, by at least about 70%, 80%, or 95% 
identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of 
identical size or when compared to an aligned sequence in which the alignment is done by a 
10 conq>uter homology program known in the art, or whose encoding nucleic acid is capable of 
hybridizing to the complement of a sequence encoding the aforementioned proteins under 
stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et al.. Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 
A /'homologous nucleic acid sequence" or 'liomologous amino acid sequence," or 
15 variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those . 
^ sequences coding for isofonns of NOVX polypeptides. Isofonns can be expressed in different 
tissues of the same organism as a result of, for example, alternative spUcing of RNA.' • 
Alternatively, isofonns can be encoded by different genes. In the invention, homologous 
nucleotide sequences include nucleotide sequences encoding for an NOVX polypeptide of 
species other than humans, including, but not limited to: vertebrates, and thus can include, e g., 
frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide 
sequences al^o include, but are not limited to, naturally occurring alleUc variations and 
mutations of the nucleotide.sequences set forth herein. A homologous nucleotide sequence 
does not, however, include the exact nucleotide sequence encoding human NOVX protein. 
Homologous nucleic acid sequences include those nucleic acid sequences that encode 
conservative amino acid substitutions (see below) in SEQ E) NOS:l, 3, 5, 7, 9, 11, 13; 15, 17^ 
19, 21, 23, 25, 27, 29, 31 and 33, as well as a polypeptide possessing NOVX biologibal 
activity. Various biological activities of the NOVX proteins are described below. 

An NOVX polypeptide is encoded by the open reading frame C'ORF") of an-NOVX 
nucleic acid. An ORF conresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon ^d terminates with one of the three "stop" codons, namely, TAA, TAG, or 
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TGA. For the purposes of tMs invention, an ORF may be any part of a coding sequence, with 
or without a start codon, a stop codon, or both. For an ORF to be considered as a good 
candidate for coding for a bona fide cellular protein, a minimum size requirement is often set, 
e.g., a stretch of DNA that would encode a protdn of 50 amino acids or more. 

The nucleotide sequences determined ft^om the cloning of the human NOVX g^es 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 
or 400 consecutive sense strand nucleotide sequence SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31 and 33; or an anti-sense strand nucleotide sequence of SEQ ID 
NOS:l,3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33; orof anaturaUyoccuning 
mutant of SEQ ID NOS. l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transCTipts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group 
can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-fector. Sudh 
probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis- 
express an NOVX protein, such as by measuring a level of an NOVX-encoding nucleic acid iii 
a sample of cells from a subject e.g., detecting NOVX mRNA levels or detemiining whether a 
genomic NOVX geiie has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" refers 
to polypeptides exhibiting activity similar, but not necessarily identical to, an actiyity of k 
polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose depaidency. A nucleic acid fiagmait encoding a "biologically- 
active portion of NOVX" can be prepared by isokting a portion SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 
13, 15, 17, 19,21,23,25,27,29,31 and 33; that encodes a polypeptide having an NOVX 
biological activity (the biological activities of flie NOVX proteins are described below), 
expressing tiie .encoded portion of NOVX protein {e.g., by recombinant expression in vitro) 
and assessing flie activity of the encoded portion of NOVX. 

NOVX Nucleic Acid and Polypeptide Variants 

The invention finlher encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown m SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
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29, 31 and 33 due to degeneracy of the genetic code and thus encode the same NQVX proteins 
as that encoded by the nucleotide sequences shown in SEQ ID N0S:1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31 and 33. In another embodiraent, an isolated nucleic acid 
molecule of the invention has a nucleotide sequence encoding a protein having an amino acid 

5 sequenceshovminSEQIDNOS:2,4,6,8,10, 12, 14, 16,18,20,22,24,26,28,30,32and. 
34. 

. In addition to the human NOVX nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33. it will be appreciated by those skilled in 
the art that DNA sequence polymorphisms that lead to changes in the amino acid sequaices of 
10 the NOVX polypeptides may exist within a population (e.g^., the human population). Such 
genetic polymorphism in the NOVX genes may exist among individuals within a population 
due to natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer 
to nucleic acid molecules comprising an open reading frame (ORP) encoding an NOVX 
protein, preferably a vertebrate NOVX protein. Such natural allehc variations can typically 
15 result in 1-5% variance in the nucleotide sequence of the NOVX genes. Any and all such 
nucleotide variations and resulting amino acid polymorphisms in the NOVX polypeptides, 
which are the result of natural allehc variation and that do not alter the functional.acti\dity of 
t^ie NOVX polypqptides, are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, and 
20 thus that have a nucleotide sequence that differs from the human SEQ ID NOS:l, 3, 5, 7, 9, 
1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33 are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural aUehc variants and homologues of 
the NOVX cDNAs of the invention can be isolated based on then- homology to the human 
IsTOVX nucleic acids disclosed herein using the human cDNAs, or a portion thereof, as a 
25 hybridization probe according to standard hybridization techniques under stringent 
hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NpS:l, 3, 5, 7, 9, 1 1, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33. In another embodiment,, the nucleic acid is at 
least Id, 25, 50, 100, 25.0, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. . In yet 
another embodiment, an isolated nucleic acid molecule of the invention hybridizes to the 
coding region. As used herein, the term "hybridizes under stringent conditions" is.intended to 
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describe conditions for hybridization and washing under which nucleotide sequences at least 
60% homologous to each other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding NOVX proteins derived fiom species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization with all or a portion of the particular human sequence as a probe 
using mefliods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no 
other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures than 
shorter sequences. Generally, stringent conditions are selected to be about 5 °C Iowct than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the targist sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 
50% of the probes are occupied at equilibrium. Typically, stringent conditions will be those in 
which the salt concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M 
sodium ion (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short 
probes, primers or oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer 
probes, primers and oligonucleotides. Stringent conditions may also be achieved with the 
addition of destabiUzing agents, such as foimamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, 
et al, (eds.). Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
(1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 
hybridized to each other. A non-limiting example of stringent hybridization conditions are 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured sahnon spenn DNA 
at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 5Q°C. An isolated 
nucleic acid molecule of tiie invention that hybridizes under stringent conditions to the 
sequences SEQ ID NOS:l, 3. 5, 7, 9, 11, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31 and 33, 
corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature ie.g., encodes a natural protein). 
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. ■ ? In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID N0S:1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31- and 33, or fragments, analogs or derivatives thereof under 
conditions of moderate stringency is provided, A non-limiting example of moderate 
5 stringency hybridization conditions are hybridization in 6X SSC, 5X Denhardf s solution, 
0,5% SDS and lOO mg/ml denatured salmon sperm DNA at 55''C, followed by one or more 
washes in IX SSC, 0.1% SDS at 37''C. Other conditions of moderate stringency that maybe 
used are well-known withm the art. See, e.g., Ausubel, et al. (eds.), 1993, Current 
Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene 

10 TransfbrandExpression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 
comprising the nucleotide sequences SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 3 1 and 33, or fragments, analogs or derivatives thereof, under conditions of low 
stringency, is provided. A non-limiling example of low stringency hybridizatioii conditions 

15 are hybridization in 35% formamide,5X SSC, 50 mMTris-HCl(pH 7.5), 5 mMEDTA, 
0.02% PVP, 0.02% FicoU, 0.2% BSA, 100 mg/ml denatiired sahnou sperm DNA, 10% 
(wt/vol) desxtran sulfate at 40'»C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl 
(pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low stiingency tiMt may 
be used are well known in the art (e.g., as employed for cross-species hybridizations). See, 

20 e.g.. Ausubel, et al: (eds.), 1993, CURRENT PROTOCOLS in Molecular Biology, John Wiley 
& Sons, NY, and Kriegler, 1 990, Gene Transfer and Expression, A Laboratory 
. Manual, Stockton Press, NY; Shilo and Weinberg, 198 1 . Prod Nail Acad Sci USA 78: 
6789-6792.; • . • ■■ ■ . - , 

Conservative Mutations 

25 In addition to naturally-occurring alleKc variants of NOVX sequences that may exist in 

flie population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into tiie nucleotide sequences SE*Q ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 2I, 23, 
25, 27, 29, 31 and 33, thereby leading to changes in the amino acid sequences of the encoded 
NOVX proteins, without altering the functional abiUty of said NOVX proteins. For example, - ' 

30 nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid 

residues can be made in the sequence SEQ ID NOS:2, 4, 6, 8, 10,. 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32 and 34. A "non-essential" amino acid residue is a residue that can be altered 

from the wild-type sequences of the NOVX proteins witiiout altering their biological ac^^ 
whereas an "essential" amino acid residue is requu-ed for such biological activity. For 
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example, amino acid residues that are conserved among the NOVX proteins of the invention 
are predicted to be particularly non-amenable to alteration. Amino acids for which 
conservative substitutions can be made are well-known within the art. 

Another aspect of the invention pertains to nucleic acid molecules aicoding NOVX 
5 proteins that contam changes in amino acid residues that are not essential for activity. Such 
NOVX proteins differ in amino acid sequence from SEQ JD NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31 and 33 yet retain biological activity. La one embodimeut, the 
isolated nucleic acid molecule comprises a nucleotide sequence. encodmg a protein, wherein 
the protein comprises an amino acid sequence at least about 45% homologous to the amino 
10 acid sequences SEQ ID NOS:2, 4, 6, 8, 10, 12. 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34. 

I^eferably, the protein encoded by the nucleic acid molecule is at least about 60% homologous 
to SEQ ID NOS:2, 4, 6, 8, 10, 12. 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34; more 
preferably at least about 70% homologous SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16. 18, 20, 22, 
24, 26, 28, 30, 32 and 34; still more preferably at least about 80% homologous to SEQ ID 
NOS:2,4,6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34; even more preferably at . 
least about 90% homologous to SEQ ID N0S:2. 4, 6, 8, 10. 12, 14, 16. 18, 20, 22, 24, 26, 28,- 
30, 32 and 34; and most preferably at least about 95% homologous to SEQ ID NOS:2, 4, 6, 8,- 
10. 12, 14, 16. 18. 20, 22, 24, 26, 28, 30, 32 and 34. . ' ' ' 

An isolated nucleic acid molecule encoding an NOVX protein homologous to the 
protein of SEQ ID NOS:2. 4, 6. 8. 10. 12. 14. 16, 18. 20. 22, 24, 26, 28, 30, 32 and 34 can be 
created by introducmg one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ..ID NOS:l, 3. 5. 7. 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 
33. such that one or more amino acid substitutions, additions or deletions are introduced into 
the encoded protein. 

Mutations can be introduced into SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31 and 33 by standard techniques, such as site-directed mutagenesis and 
PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted, non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the ammo acid residue is replaced with an amino acid residue 
having a similar side chain. FamiUes of amino acid residues havmg similar side chains have 
been defined within the art. These families include amino acids with basic side chains (e.^.. 
lysine, arginine. histidine), acidic side chains (e.g., aspartic acid, glutamic acid); uncharged 
polar side chains (e.^?., glycine, asparagine. glutamine, serine, threonine, tyrosine, cysteine), 
npnpolar side chains (e.g., alanine, v^ine, leucine, isoleucine, prohne, phenylalanine, 
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methionine, tryptophan); beta-branched side chains (e;g; j'thre6nine,- valiaeV isoleucine) and 
aromatic side chains (e.^., tyrosine, phenylalanine,- tryptophan, histidine). Thus, a predicted 
non-essential anaino acid residue in the NOVX protein is replaced with another amino acid 
residue from the same side chain family. Alternatively, in another embodiment, mutatioiis can 
5 be introduced randomly along all or part of an NOVX coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened for NOVX biological activity to 
identify inutants that retain activity. Followingmutagenesis SEQIDNOSrl, 3, 5, 1, 9, 11, 13, 
. 15,' 17, 19, 21, 23, 25, 27, 29, 31 and 33, the encoded protein can be expressed by any 
recombinant technology known in the art and the activity of the protein can be determined. 

10 The relatedness of amino acid families may also be determined based on side chain 

interactioiis. Substituted aiiiino acids may be ftdly conserved "strong*' residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues maybe any 
one of the following groups: STA, NEQK,- NHQK, NDEQ, QHRK, MILVi MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 

1 5 substituted for each other. Likewise, the "weak" groiq) of conserved residues may be any one 
of the foUowing: CSA, ATV, SAG, ST^^C, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLIMi HFY, wherein the letters within each groiq) represent the single letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (z) the ability to form 
protein:protein interactions with other NOVX protems, other cell-surface proteins, or 
20 biologically-active portions thereof, (ji) complex formation between a mutant NOVX protein 
and anNOVX ligand; or (m) the abiUty of a mutant NOVX protein to bmd to an intracellular 
target protdn or biologically-active portion thereof; (jeig. avidin proteins). ' 
^ : In yet.another embodiment, a mutant NOVX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). ■ • 
25 Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizal?le to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 
33, or fragments, analogs or derivatives thereof An "antisense" nucleic acid comprises a, 
30 nucleotide sequence that is complementary to a "sense" nucleic acid encoding a protein (e.g., 
complementary to the coding strand of a double-stranded cDNA molecule or complementary 
to an mkNA sequence). In specific aspects, antisense nucleic acid molecules are provided that 
comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides 
or an entire NOVX coding strand, or to only a portion thereof Nucleic acid molecules 
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encoding fragments, homologs, derivatives and analogs of an NOVX protein of SEQ ID 
NOS:2,4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34, or antisense nucleic acids 
complementary to an NOVX nucleic acid sequence of SEQ ID N0S:1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 3 1 and 33, are additionally provided. 
5 In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 

region" of iJie coding strand of a nucleotide sequence encoding an NOVX protein. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 
10 encoding the NOVX protein. The term "noncoding region" refers to 5' and 3' sequences which 
flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 
3' untranslated regions). 

■ Given the coding strand sequences encoding the NOVX protein disclosed herein, 

antisense nucleic acids of the invention can be designed according to the rules of Watson and 
1 5 Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of NOVX mRNA, but more preferably is an ohgonucleotide that is 
antisense to only a portion of the coding or noncoding region of NOVX mRNA. For example, 
the. antisense oKgonucleotide can be complementary to the region surrounding the traslation 
start site of NOVX mRNA. An antisense oKgonucleotide can be, for example, about 5, 10, 15, 
20 29, 25. 30, 35, 40, 45 or.50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synfliesis or enzymatic Ugation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense oUgonucleotide) 
can be chemically synthesized using natin-ally-occurring nucleotides or variously modified 
nucleotides designed to increase the biological stability of the molecules or to increase the 
25 physical stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxyhnethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-fluouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymeaiyluracil, 5-methoxyuiacii, 
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2- methyltMo-N6-isopentenyladeiiine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosiBe, 2-tbiocytosine, 5-methyl-2-thioiiracil, 2-thioxjracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thioia:acil, 

3- (3-amiiio-3-N"2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurme, Alternatively, the 
5 antisense nucleic acid can be produced biologically using an expression vector into which a 

nucleic acid has been subcloned in an antisense ori^tation KNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described ftirther in the following subsection). . 

The antisense nucleic acid molecules of the invention are typically administered to a 

10 subject or generated in situ such that they hybridize with or bind to cellular mRNA aad/or 

genomic DNA encoding an NOVX protein to thereby inhibit expression of the protein (e?.^. , by 
inhibiting transcription and/or translation). The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 
antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 

15 the major groove of the double helix. An example of a route of administration of antisense 

nucleic acid molecides of the invention includes direct injection at a tissue site. Altematively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface 

20 {e,g,, by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens). The antisense nucleic acid molecules can also be delivered to 
cells using the vectors described herein. To achieve sufiBcient nucleic acid molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a strong 
pol n or pol m promoter are preferred. : 

25 In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
, double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, the 
strands run parallel to each other. See, e.g., Gaultier, al, 1987. Nucl Acids Res. 15: 
6625-6641 . The antisense nucleic acid rnolecule can also comprise a . 

30 2'-o-methylribonucleotide (See, e.g„ Inoue, et al 1987. Nucl Acids Res. 15: 6131-6148)* or a 
chnneric RNA-DNA analogue (See, e.g., Ihoue, et aL, 1987. FEBSLett. 215: 327-330. 
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Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
5 nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. ' 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mKNA, to which they have a 
10 complementary region. Thus, ribozymes (e.g., hammeihead ribozymes as described in 

Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to cataiyticaUy cleave NOVX 
mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme having 
specificity for an NOVX-encoding nucleic acid can be designed based upon the nucleotide 
sequence of an NOVX cDNA disclosed herein (i.e., SEQ ID NOS:i, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31 and 33). For exan:q>le, a derivative of a Tetrahymena 1^19 IVS RNA 
can be constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in an NOVX-encoding mRNA. 5*66, e.^., U S. Patent 
4,987,071 to Cech, et cd. and U.S. Patent 5,1 16,742 to Cech, et al. NOVX mRNA can also be 
used to select a catalytic RNA having a specific ribonuclease activity fi-om a pool of RNA 
20 molecules. See, e.g. , Bartel et al., (1 993) Science 261:1411-1418. 

Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.^., the NOVX 
promoter and/or enhancers) to form triple heUcal structures thatpreventtrans^ption of the 
NOVX gene in target cells. See, e.g., Helene, 1991. Ajiti6ancer Drug Des. 6: 569-84; Helene, 
et al. 1992. Ann. KY. Acad. ScL 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to inq)rove, e.^., the stabiKty, hybridization, or solubility 
of the molecule. For example, the deoxyrifeose phosphate backbone of the nucleic acids can 
be modified to generate peptide nucleic acids. See. e.g.. Hyrup, et al, 1996. Bioorg Med 
Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics (e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by 
a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to aUow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA ohgomers can be performed using 
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Standard solid phase peptide synthesis protocols as described in Hyrup, a/., 1996. supra; 
Perry-O-Keefe, et ah, 1996. Proc. Natl. Acad. Sd. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic appUcations. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
5 expression by, e.g., inducing transcription or translation arrest or inhibiting repUcation. PNAs 
of NOVX can also be used, for exanq>le, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PGR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.^., IS, nucleases (^-ee^Hyrup.e? a/., 1 996. j«jr,ra); or as probes or primers 
for DNA sequence and hybridization (See, Hyrup, et al, 1996, supra; Peny-OKeefe, et al, 
10 1996. 5«pm). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 
stabiUty or cellular uptake, by attaching lipophiUc or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of hposomes or other techniques of drug 
deUvery known in the art. For example, PNA-DNA chimeras ofNOVX can be generated that 
may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes {e.g., RNase H and DNA polymerases) to interact with the DNA portion 
while the PNA portion would provide high binding afBnity and specificity. PNA-DNA 
chimeras can be Imked using Knkers of appropriate lengths selected in terms of base stackmg, 
number of bonds between the nucleobases, and orientation {see. Hyrup, et al., 1996. supra). 
The synthesis of PNA-DNA chimeras can be perfoimed as described in Hyrup; et al. 1996. 
supra and Finn, ^ al, 1996. Nucl Acids Res 24: 3357-3363. For example, a DNA cham can 
be synthesized on a solid support using standard phosphoramidite coupling chemistry, and 
modified nucleoside analogs, e^g., 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine 
phosphoramidite, can be used between the PNA and the 5 ' end of DNA. See. e.g.. Mag, et al, 
1989. Nucl Acid Res 17: 5973-5988. PNA monomers are then coupled in a stepwise m^er ' 
to produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment. See. e.g., 
Finn, etal, 1996.supra. Alternatively, chimeric molecules can be synthesized with a 5' DNA 
segment and a 3' PNA segment. See. e.g.. Petersen, et al, 1975. Bioorg. Med. Chem. Lett. 5: 
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1119-11124. 



In other embodhnents, the oHgonucleotide may include other appended groups such as 
peptides (e.g„ for targetmg host cell receptors in vivo), or agents fecilitating transport across 
the cell membrane (see. e.g., Letsmgei^, et al, 1989. Proc. Natl Acad. Sci. U.S.A. 86: 
6553-6556; Lemaitre, ezf a/., 1987. Proc. Natl Acad Sci. 84: 648-652; PCX Publication No. 
WO88/09810) or the blood-brain barrier (jee, e.^.. PCTPubhcation No, WO 89/10134). hi 
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addition, oligonucleotides can be modified with hybridization triggered cleavage agents (see, 
e.g., Krol, et al, 1988. BioTechniques 6:958-976) or intercalating agents {see. e.g., Zon, 1988. 
Pharm. Res. 5: 539-549). To this end, the oligonucleotide maybe conjugated to anotiier 
molecule, e.g. , a peptide, a hybridization triggered cross-linking agent, a transport agent, a 
hybridization-triggered cleavage agent, and the like. 

NOVX Polypeptides ' 

A polypeptide according to the mvention includes a polypeptide including the amino 
acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34. Tlie invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residues shown 
iii SEQID NOS:2,4, 6,8, 10. 12, 14, 16, 18, 20, 22, 24. 26, 28, 30, 32 and 34 while stiU 
encoding a protein that maintains its NOVX activities and physiological functions, or a 
fundtional fragment thereof. 

hi general, ah NOVX variant that preserves NOVX-like fimction includes any variant 
in which residues at a particular position in the sequence have been substituted by other amino 
adds, and further include the possibihty of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibiUty of deleting one or more residues 
fix>m the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed 
by the invention, lii favorable circumstances, the substitution is a conservative substitution as 
defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and biologically, 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided 
are polypeptide fragments suitable for use as immunogais to raise ai^ti-NOVX antibodies. In 
one embodiment, native NOVX proteins can be isolated from cells or tissue sources by mi 

appropriate purification scheme using standard protem purification techniques. In anofc^ 
embodunent, NOVX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an NOVX protein or polypeptide can be Synthesized chemically 
using standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof, 
is substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the NOVX protein is derived, or substantiaUy free from chemical . 
precursors or other chemicals when chemically synftesized. The language "substantially fi«e 
of ceUuiar material" includes preparations of NOVX proteins in which the protein is separated 
from cellular components of tiie cells from which it is isolated or recombinantly-produced. hi 
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one embodiment, the lmguage '-substantiaUy fi^e Of cellular material'^ ^ 
NOVX proteins having less than about 30% (by dry weight) of non-NOVX proteins (also 
referred to herein as a "contaminating protein"),- more preferably less than about 20% of 
non-NOVX proteins, still more preferably less than about 10% of non-NOVX proteins, and 
most preferably less than about 5% of non-NOVX proteins. When the NOVX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably 
substantially free of culture medium, i.e., culture medium represents less than about 20%, 
more preferably less than about 10%, and most preferably less than about 5% of the volume of 
the NOVX protein preparation. 

Th^ language "substantially free ofchemical precursors or other chemicals" includes 
preparations of NOVX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantiaUy-free of chemical precursors or other chemicals" includes preparations 
of NOVX proteins having less than about 30% (by dry weight) of chemical precursors or 
non-NOVX chemicals, more preferably less than about 20% chemical precursors or 
non-NOVX chemicals, stiU more preferably less than about 1 0% chemical precursors or 

non-NOVX chemicals, and most preferably less than about 5% chemical precursors or 
non-NOVX chemicals. 

Biologically-active portions of NOVX prx)teins include peptides comprising amm^ 

acid sequences sufficiently homologous to or derived from the ammo acid sequences of the 
NOVX protems (e.^., ihe amino acid sequence shown m SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 
16. 18, 20. 22. 24, 26, 28. 30. 32 and 34) that include fewer ammo acids ih^ the' full-length 
NOVX proteins, and exhibit at least one activity ofan NOVX protein. TypicaUy. biologically- 
active portions comprise a domain or motif with at least one activity of the NOVX protein A 
biologically-active portion ofan NOVX protein can be a polypeptide which is, for example, 
10, 25, 50, 100 or more amino acid residues in length. 

Moreover, other biologically-active portions, m which other regions of the protem are 
deleted, can be preparedby recombinant techniques ar^ evaluated for one or morb of the 
functidnal activities of a native NOVX protein. 

In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 
NOS:2. 4, 6, 8, 10, 12, 14, 16. 18. 20. 22. 24, 26, 28, 30, 32 and 34. M other embodim^ts. the 
NOVX protein is substantially homologous to SEQ ID N0S:2. 4. 6. 8. 10, 12, 14, 16,' 18, 20. 
22, 24, 26. 28, 30. 32 and 34. and retains the functional activity of the protein of SEQ 1d' 
NOS:2, 4, 6, 8. lO; 12. 14, 16, 18. 20, 22, 24. 26. 28. 30, 32-and 34, yet di^ in ammo acid 
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sequence due to natural allelic variation or mutagenesis,, as described in detail, below. 
Accordingly, in another embodiment, the NOVX protein is a protem that comprises an amino 
acid sequence at least about 45% homologous to the amino acid sequence SEQ ID N0S:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and 34, and retains the functional activity of 
the NOVX proteins of SEQ ID NOS;2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 30, 32 
and 34. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced 
in the sequence of a first amino acid or nucleic acid sequence for optimal aligmnent with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
ciori^sponding amino acid.positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that 
position (i.e., as used herein amino acid or nucleic acid "homology" is eqmvalent to amino 
acid or nuclac acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be detemiined using computer programs known 
in the art, such as GAP software provided in the GCG program package. See. Needleman and 
Wunsch. 1970. 48: 443-453. Using GCG GAP software with the following settings 

for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension . 
penalty of 0.3, the coding region of the analogous nucleic acid sequaices referred to above 
exhibits a degree of identity preferably of at least 70%, .75%, 80%, 85%, 90%, 95%, 98%, or 
99%, witii the CDS (encoding) part of the DNA sequence shown in SEQ ID N0S:1, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33. 

The term "sequence identity" refers to tiie degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The terai "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over tiiat region of comparison, determining the number of 
positions at which the identical nucleic acid base {e.g., A, T, C, G, U, or I, in tiie case of 
nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in tiie region of comparison (z.e., 
tiie window size), and multiplying tiie result by 100 to yield flie percentage of sequence 
identity. The term "substantial identity" as used herein denotes a characteristic of a 
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polynucleotide sequaice, wherein the polynucleotide-comprises a sequence that has at least 80 
percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent 
sequence identity, more usually at least 99 percent sequence identity as compared to a 
reference sequence over a con^arison legion. 

5 ' Chimeric and Fusion Proteins 



The inveotion also provides NOVX chimeric or fusion proteins. As used herein, an 
NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide operatively- 
hnked to a non-NOVX polypq>tide. An "NOVX polypeptide" refers to a polypqptide having 
an amino acid sequence corresponding to an NOVX protan SEQ ID NOS:2, 4, 6, 8, 10, 12, 
10 14, 16, ,18, 20, 22, 24, 26, 28. 30, 32 and 34, whereas a "non-NOVX polypeptide" refers to a 
polypeptide having an amino, acid sequaice corresponding to a protein that is not substantially 
homologous to the NOVX protein, e.g. , a protein that is different from the NOVX protem and 
that is draived from the same or a different organism. Within an NOVX fusion protein the 
NOVX polypeptide can correspond to all or a portion of an NOVX protein. In one 
15 embodiment, an NOVX fiision protein comprises at least one biologically-active portion of an 
NOVX protein. In another embodiment, an NOVX fusion protem comprises at least two 
biologically-active portions of an NOVX protein. In yet another embodiment, an NOVX 
frision protein comprises at least three biologically-active portions of an NOVX protein. 
Within the fiision protein, the term "operatively-lmked" is intended to indicate that the NOVX 
20 polypeptide and the non-NOVX polypeptide are fused in-frame with one auother. The 

non-NOVX polypeptide can be fused to the N-terminus or C-tenninus o 
polypeptide. 

In one embodiment, the fiision protein is a GST-NOVX fiision protem in which the 
NOVX sequences are fiised to the C-terminus of the GST (glutathione S-transferase) . 

25 sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. ; . 

. . In another embodhnent, the fiision protein is an NOVX protein containing a 
heterologous signal sequence at its N-temiinus. In certain host cells (e g., mammahan host 
cells), expression and/or secretion of NOVX can be increased through use of a heterologous 
30 signal sequence. . 

. . In yet another embodiment, the fiision protein is an NOVX-immunoglobuIin fiision 
protein in which the NOVX sequences are fiised to sequences derived from a member Of the 
immunoglobuhn protein family. The NOVX-immunoglobulin fiision proteins of the invention 
can be incorporated mto pharmaceutical compositions and administered to a subject to inhibit 
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an interaction between an NOVX ligand and an NOVX protein on the surface of a cell, to 
thereby suppress NOVX-mediated signal transduction m VIVO. The NOVX-immunoglobulin 
fusion proteins can be used to affect the bioavailability of an NOVX cognate ligand. 
Inhibition of the NOVX ligand/NOVX interaction may be useftd therapeutically for both the 
5 . treatment of proliferative and differentiative disorders, as well as modulating ie.g. promoting 
or inhibiting) cell survival. Moreover, the NOVX-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NOVX antibodies in a subject, to purify 
NOVX ligands, and in screening assaj/s to identify molecules that inhibit the interaction of 
NOVX with an NOVX ligand. ' 

10 An NOVX chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for q)propriate termini, filling-in of cohesive ends as appropriate, 

15 alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In 

another embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PGR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary oveihangs between two 
consecutive gene fragments that can subsequently be annealed and reanqilified to generate a 
20 chimeric gene sequence {see. e.g., Ausubel, et al. (e6s.) Current Protocols in Molecular 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that aheady encode a fusion moiety (e.g., a GST polypeptide). An NOVX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is hnked 
in-frame to the NOVX protein. 

25 NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists {i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein can 
be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX protein). 
An agonist of the NQVX protein can retain substantially the same, or a subset of, the 
30 biological activities of the naturally occurring form of the NOVX protein. An antagonist of 
the NOVX protein can inhibit one or. more of the activities of the naturally occurring form of 
the NOVX protein by, for example, competitively binding to a downstream or iq)stream' 
member of a cellular signaling cascade which includes the NOVX protein. Thus, specific 
biological effects can be eUcited by treatment with a variant of limited function. In one 
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embodiment, treatment of a- subject wilh a variant.having a subset of the biological activities 
of the naturally occuning form of the protein has fewer, side effects in a subject relative to 
treatment wth the naturally occurring form of the NQVX proteins. . 

Variants, of the NOVX proteins that function as dtherNOVX agonists (i.^^^^ 

or as NOVX antagonists can be identified by screening combinatoriailibraries of mutants 
(e.^., truncation mutants) of the NOVX proteins for NOVX protein agonist or antagonist 
activity.. In one embodiment, a variegated Ubrary of NOVX' variants is generated by ^ 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
Ubraiy. A variegated Hbrary of NOVX variants can be produced by, for example, 
enzymatically Ugating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential NOVX sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of 
NOVX sequences therein. There are a variety of methods which can be-used t6 produce 
libraries of potential NOVX variants from a degenerate oUgonucleotide sequence: Chemical 
synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, 
and the synthetic gene then Kgated into an ^ropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of ail of the sequences encoding the 
desired set ofpotential NOVX sequences. Methods for synthesizing degenerate 
oUgonucleotides are well-known within the art. See. e.g.. Narang, 1983. Tetrahedron 39: 3; 
Itakxa^etaL, 19S4. Annu. Rev. Biochem. 53: 323; Itakura, e/ a/., 1984. Science 198: 1056; 
Jke^etal., 1983. Nucl Acids lies. 11:477, 

Polypeptide Libraries 

^ libraries of fragments of the NOVX protein coding sequences can be used 

to generate a variegated population of NOVX fragments for screening and subsequent 
selection of variants of an NOVX protein. In one embodiment, a Ubraiy of coding sequence 
fragments can be generated by treating a double stranded PGR fragment of an NOVX coding 
sequence.with a nuclease under conditions wherein nicking occurs only about once per ■ 
molecule, denaturing the. double stranded DNA, renaturing the DNA to fonn double-stranded 
DNA.that can include sense/antisense pairs from different nicked pix>ducts, removing single 
stranded portions from reformed duplexes by treatment with Si nuclease, and Ugating the 
resulting fragment library into an expression vector. By this method, expression Ubraries can 
be derived which encodes N-terminal and internal fragments of various sizes of the NOVX 
proteins. ; . . . , 
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• • ••- Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of NOYX proteins. The most 
widely used techniques, which are amenable to high thrbugjiput analysis, for screening large 
gene libraries typically include cloning the gene library into rpplicable expression vectora, 
transforming appropriate cells with the resulting library of vectora, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a new technique that enhances the frequency of functional mutants in the 
libraries, can be used in, combination with the screening assays to identify NOVX variants. 
See, e.g., Arkin and Yourvan, 1992. Proc. Natl Acad. Sci. USA 89: 7811-7815; Delgrave, et 
aL, 1993. Protein Engineering 6:321-331. 

Anti-NQVX Antibodies 

Also included in the invention are antibodies to NOVX proteins, or fragments oir ' 
NOVX proteins. The temi "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specificaUy binds (immunoreacts with) an antigen. Such ■ 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain. Fab, 
Fab- and F(ab')2 fragments, and an Fab expression Hbrary. In general, an antibody molecule 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
fix)m one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgGz, and others. Furthennore, in humans, the light 
chain may be a kappa chain or a lambda chain. Reference herein to antibodies inchides a 
reference to all such classes, subclasses and types of human antibody species. 

An isolated NOVX-related protein of the invention may be intended to serve as an 
antigen, or a portion or fragment thereof and additionally can be used as an immunogen to 
generate antibodies that immunospecifically bind the antigen, using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length protein can be used or, 
alternatively, the invention provides antigenic peptide fragments of the antigen for use as 
immunogens. An antigenic peptide. fragment comprises at least 6 amino acid residues of the 
amino acid sequence of the frill length protein and" encompasses an epitope thereof such that an 
antibody raised against the peptide fonns a specific immune complex with the full length - 
protein or with any fragment that contains the epitope. Preferably, the antigenic peptide 
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comprises at least40 airdno acid' residues, or at least IS amino acid residues, or at least 20 
amiao acid residues, or at least 30 amino acid residues. Preferred epitopes encompassed by 
the antigenic peptide are regions of the protein that are located on its surface; commonly these 
are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX-related protein that is located on the surface of the 
protein, e.g., a hydrophilic region. A hydrophobicity analysis of the human NOVX-related 
protein sequence will indicate which regions of a NOVX-related protein are particularly 
hydrophilic and, therefore, are likely to encode surface residues useful for targeting antibody 
production. As a means for targeting antibody production, hydropathy plots showing regions 
of hydrophilicity and hydrophobicity may be generated by any method well known in the art, 
including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or without 
Fourier transformation. See, e.g., Hopp and Woods, 1981, iVoc. Nat. Acad. Sci. USA 78: 
3824-3828; Kyte and DooUttle 1982, J. Mol. Biol. 157: 105-142, each of which is incoiporated 
herein by reference in its entirety. Antibodies that are specific for one or more domams within 
an antigenic protem, or derivatives, fiagments, analogs or homologs thereof, are also provided 
herein. 

A protein of the inventioio, or a derivative, firagmoit, analog, homolog or orfholog 
thereof may be utilized as an immunogen in the generation of antibodies that 
immunospecificaUy bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal 
•or monoclonal antibodies dh-ected against a protein of the invention, or against derivatives, 
fragments,^^alogs homologs or orthologs thereof (see, for example. Antibodies: A Laboratory 
Manual; Harlow and Lane, 1988, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, incorporated herein by reference). Some of these antibodies are discussed below. 
Polyclonal Antibodies 

For the production of polyplonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more mjections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring inununogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being unmunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
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albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further 
include an adjuvant. Various adjuvants used to increase the.immunological response include, 
but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum 
hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions,' 
5 peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humaios such as Bacille 
Cahnette-Guerin and Corynebacterium parvum, or shnilar immunostimulatory agents:* 
Additional examples of adjuvants which can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 

10 isolated from the mammal (e.g.,fi^m the blood) and further purified by well known 

techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or altematively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, maybe 
immobilized on a column to purify the immune specific antibody by immunoafBiuty 

15 chromatography. Purification of immunoglobulins is discussed, for exan^le, by D. WiDdnson 
(The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April!?, 
2000), pp. 25-28). 

Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
20 used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product Jn particular, the complementarity detemiining regions (CDRs) of 
the monoclonal antibody are identical in all the molecules of the population. MAbsthus 
contain an antigen binding site enable of immunoreacting with aparticular epitope of the 
25 antigen characterized by a unique binding afiBnity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1 975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is topically immunized with an immnni^ir^g agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
30 bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or 
a fiision protein thereof Generally, either peripheral blood lymphocytes are used if cells of 
human origm are desired, or spleen cells or lymph node cells are used if non-human 
mammaHan sources are desired. The lymphocytes are then fused with an immortalized deU 
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line using a.suitable fosing agent, such as polyethylene glycol, to form. a hybridoma ceU 
(Coding, Monoclonal Antibodies: Principles; and Practice, Academic Press,- (1986) pp. 
59-103). Immortalized ceU lines are usuaUytransfomiedmammaUan cells, particu^ 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines 
5 are employed. The hybridoma ceUs can be cultured in a suitable culture medium that 

preferably contains one or more substances that inhibit the growth or survival of the unfused, 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guaume 

phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 

10 substances prevent the growth of HGPRT-deficient ceUs. 

Preferred immortalized cell lines are those that fuse efficiently^ 
level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized ceU lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San 

15 Diego, CaUfomia and ttie American Type Culture Collection, Manassas, Virginia. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the 

Prod"<^onofhuman monoclonal antibodies (Kozbor,j:/mmtmo/., 133:3001 (1984);^^ 
et aL, Monoclonal Antibody Production Techniques and Applications, Marcel 
Dekker, Inc., New York, (1987) pp. 51-63). 

!0 The culture medium in which the hybridoma cells are cultured can then be assayed for 

the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies prodUcbd by the hybridoma cells is detemiined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked- immunoabsorbent assay (ELIS A). Such techniques and assa^ 

5 the art. The binding affinity of the monoclonal antibody can, for example, be deteimined by . 
the Scatchard analysis ofMunson and Pollard, ^naZ^BiocW, 107:220(1980)., Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigln 
. are isolated. . . ■ • . • • - 

After the desired hybridoma cells are identified, the clones can be subcloned by 
) li«^tingdilutionproceduresandgrownbystandardmethods. Suitab^^^ 

purpose include, for exanq,le, Dulbecco's Modified Eagle's Medium and RPMI-1 640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monocloiial antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
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such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
5 the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
usmg oUgonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA can be placed into expression vectors, 
which are then Iransfected mto host cells such as simian COS cells, Chinese hamster ovary 
10 (CHO) cells, or myeloma cells that do not otherwise produce immunoglobuhn protem, to 
obtain the synthesis ofmonoclonal antibodies in the irecombinant host ceUs: The DNA also 
can be modified, for example, by substituting the coding sequence for human heavy and Ught 
chain constant domains in place of the homologous murine sequences (U. S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
domains of an antibody of the invention, or can be substituted for the variable domains of one 
antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 
Humanized Antibodies 

The antibodies directed against the protein aitigens of the invention can further . 
comprise humanized antibodies or human antibodi<^. These antibodies are suitable for • 
administration to hmnans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins. 

immmioglobulin chains or fragments thereof(such as Fv, Fab, Fab', F(ab')2 or other antigen- ' 
binding subsequences of antibodies) that are principaUy comprised of the sequence of a human 
immunoglobulin, ^d contain minimal sequence derived from a non-human immunoglobulin 
Hmnanization can be performed following the method of Winter and co-workers (Jones et al 
Nature, 321:522-525 (1986); Riechmann et Nature, 332:323-327 (1988); Verhoeyen et al' 
Science, 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also US. Patent No; 5,225,539.) • In some 
instances, Fv framework residues of the human immunoglobulin ai« replaced by 
corresponding non-human residues. Humanized antibodies can also comprise residues which 
are fomid neitiier in the recipient antibody nor in the imported CDR or framework sequences. 
Jn general, Ihe humanized antibody will con^rise substantially all of at least one. and typically 
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two, variable domains, in which aU or substantially all of the GDR regions correspond to those 
of a noB-hmnan innnnnoglobulin and all or substantially all of the framework regions are 
those of a human irnmunoglobulin consensus sequence. The humanized antibody optimally 
also wiU compiise at least a portion of an immunoglobulin constant region (Fc), typically that 
of a human immmaoglobulin (Jones etal., 1986; Riechmam: et al.. 1988; and Presta, Gu/r Op 
Struct. BioL, 2:593-596 (1992)). 

Human Antibodies 

Fully human antibodies relate to antibody molecules in which essaitially the entire 

sequences ofboth the Ught chain and the heavy chain, including the CDRs, arise fiom^^^ 
ger.es. Such antibodies are termed '«human antibodies", or "fully human antibodies" herein 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-ceU 
hybndoma technique (see Kozbor. et al.. 1983 Immunol Today 4: 72) and the EBV hybridoma 
techmquetoproducehumanmonoclonal antibodies (see Cole, et al., 1985 hr: Monoclonal 
ANmoDlESANBCANCBR THERAPY, AlanR. Liss, Inc., pp. 77-96). Humanmonoclo^^^ 

antbodiesmaybeutilizedinlhepracticeofthepresentinventionandmaybeproduced 
usmg human hybridomas (see Cote, et al, 1983. Proc Natl Acad Sci USA 80: 2026-2030) or 

by transforming human B-ceUs with Epstein Virus in vitro(see Cole, etal.;i985M- 
MONOCLONALANTlBODIESANDCANCERTtoRAPY,AlanR.Liss,lnc.,pp 77-96) 

m addition, human antibodies can also be produced using additional techniques 
mcludxng phage display libraries (Hoogenboom and Winter, J. Mol Biol. 227-381 (1991)- 
Marks et al., ^ Mol. BioL, 222:581 (1991)). SimHarly, human antibodies can be made by ' 
antroducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenousimmunoglobulingeneshavebeenpartiaUyor^^^^^^ Upon 
challenge, human antibody production is observed, which closely resembles tiaat seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire 
approach rs described, for example, in U.S. PatentNos. 5.545,807; 5,545,806; 5.569,825'- 
5,625,126; 5,633,425; 5,661,016, and m Marks et al. (Bio/Technology 10, 779-783 (1992)V 
Lonhe^etal.(i.^.e368 856-859 (1994)); Morrison (iV.^.e 368. 812-13 (1994)); Fishwild 
^^V^aU^reBiotechnology 14, 845-51 (1996)); Neuberger (Ar.^.ei?,.,..,„,;^ 14, . 
(1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1995)) 

^----tftodiesmayadditionaUybeproducedusingtiansgenicnonhmnanan^^ 
whrch are modified so as to produce fully human antibodies ratirer tiaan tiie animal's 

^dogenous antibodies in response to challenge by an antiger. (See PCT 
WO94/02602). Tire endogenous genes encoding the h^vy and hght immunoglobu^ 
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the nonhuman host have been incapacitated, and active loci encoding human heavy and light 
chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite human 
DNA segments. An animal which provides all the desired modifications is then obtained as 
5 progeny by crossbreeding intermediate transgenic animals containing fewer than the full 

complement of the modifications. The preferred embodiment of such a nonhuman animal is a 
mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. 
The antibodies can be obtained directly firom the animal after immunization with an 

10 immunogen of interest, as, for example, a preparation of a polyclonal antibody, or alternatively 
fi-om immortalized B cells derived fiiom the animal, such as hybridomas producing 
monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with human 
variable regions can be recovered and expressed to obtain flie antibodies directly, or can be 
further modified to obtain analogs of antibodies such as, for example, single chain Fv 

15 molecule. 

An example of a mefliod of producing a nonhuman host, exemplified as a mouse, 
lacking e:q)ression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No: 5,939,598. It caii be obtained by a method including deleting the J segment genes from at 
least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of 
20 the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain 
locus, the deletion being effected by a targeting vector containing a gene encoding a selectable 
marker; and producing fi-om the embryonic stem ceU a transgenic mouse whose somatic and 
germ cells contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed 
25 in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, 
introducing an expression vector containing a nucleotide sequence encoding a hght chain into 
another mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell 
expresses an antibody containing the heavy chain and the light chain. 

30 In a further improvement on this procedure, a method for identifying a clinically 

relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

Fab Fragments and Single CfaaijD Antibodies 
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According to the-invention, techniques can be ad^ted for the productibn of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g.; U.S . Patent 

No. 4,946,778). . M addition, methods can be adapted for the coristniction of F3. 'e^^^^ 
Ubraries (see e.g., Huse. et al.. 1989 Science 246: 1275-1281) to allow rapid and effective 
identification of monoclonal F., fi-agments with the desired specificity for a protein or 
- derivatives, fi-agm«its, analogs or homologs thereof Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F(.,.>2 fragment produced by pepsin digestion of an antibody molecule; (ii) 
an F.b fragment generated by reducing the disulfide bridges of an F(ab,2 fragment; (iii) an Fab 
fragment generated by the treatment of the antibody molecule with p^ain and a reducing - 
agent and (iv) Fv fragments. ! . , v. . .. 

Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or hmnanized. antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
bmding specificities is for an antigenic protein of the invention. The second binding target is 
any other antigeu. and advantageously is a ceU-surfece protein or receptor or receptor subunit 

Methods for making bispecific antibodies are known in the art TraditionaUy the 
recombinant production of bispecific aatibodies is based on the co-expression of two" 
immunoglobulin heavy-chain/Hght-chain pairs, where the two heavy chains have different 
specificities (Mlstein and Cuello, Nature, 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and hght chams, these hybxidomas (quadromas) produce 
a potential mixture often different antibody molecules, of which only one has the correct 
bispecific structiire. The purification^ of the correct molecule is usually accompUshed by 
afiBmty chromatography steps. Similar procedures are disclosed in WO 93/08829. published 
13 May 1993, and in Trauneckere^fl/., 1991 ^Jl£BC>/., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combmmg sites) can be fiised to immunoglobuhn constant domain sequences.. The fiision 
preferably is with an hmnunoglobuHn heavy-chain constant domain, comprismg at least part 
of file hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant • 
region (CHI) containing the site necessary for light-chain binding present in at least one of the 
fusions. I>NAsencodingtheimmmioglobulinheavy-chainfiisionsand,ifdesired.the • 
immunoglobuhn hght chain, are mserted into separate egression vectors, and are co- • 
transfected into a suitable host organism. For fiirther details of generating bispecific 
antibodies see, for example. Suresh et al.. Methods in Enzymology, 121:210 (1986). 
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According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodiiners which 
are recovered from recombinant cell culture. The preferred interface comprises at least apart 
of the CH3 region of an antibody constant domain. In this method, one of more small amino 
5 acid side chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing 
large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a 
mechanism for increasing the yield of the heterodimer over other unwanted end-products such 
10 as homodimers. 

Bispecific antibodies can be prepared as foil length antibodies or antibody fragments 
(e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from 
antibody fiagments have been described in the Kterature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al.. Science 229:81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. 
These firagments are reduced in the presence of the dithiol complexing agent sodium aisenite 
to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 
fiagments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the 
Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative 
to form the bispecific antibody. The bispecific antibodies produced can be used as agents for 
the selective immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coU and chemically 
coupled to form bispecific antibodies. Shalaby et si., J. Exp, Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab* 
fragment was separately secreted from E. coli and subjected to directed chemical coupling in 
vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to 
cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for maidng and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies * 
have been produced using leucine zipper^. Kostehiy et al., J. Immunol. 148(5): 1547- 1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fiision. The antibody homodimers were reduced 
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at the hinge region to fonn monomers and then re-oxidized to fonn the 'antibody heterodimers. 
This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by HoUinga: et al., Proc Natl Acad. Set USA 90:6444-6448 (1993) has 
provided an alternative mechanism for making bispecific antibody fragments. The fragments 
comprise a heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) 
by a linker which is too short to allow pairing between the two domains on the same chain. 
Accordingly, the Vh and Vl domains of one fragment are forced to pair with the 
complementary Vl and Vh dbmains of another fragment, thereby forming two antigen-binding 
sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also been reported. See, Gnxber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For exanqjle, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). ... 

Exemplaiy bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic aim 
of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell i^eptor molecule (e.g. CD2, CDS. CD28, or B7) or 
Fc receptor, for IgG (FcyR). such as FcyRI (CD64), FcyRE (CD32) and FcyRHI (GDI 6) so as 
to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific 
antxbodxes can also be used to dfrect cytotoxic agents to ceUs which express a particular 
antigen. These antibodies possess an antigen-binding aim and an arm which bmds a cytotoxic 

agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, of TETA. Another 
bispecific antibody of interest binds the protein antigen described herein and fiirther binds 
tissue factor (TF). 

Heterocoiijugate Antibodies . : . . 

Heteroconjugate antibodies are also within ihe scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treataient of HIV mfection (WO 91/00360- WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared i^ vilrc using 
known methods m synthetic protein chemistry, including those involving crosslinking agents 
For example, immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate 
and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No 
4,676,980. ■ . . 
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-Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 
disulfide bond formation in this region. The homodimeric antibody thus generated can have 
improved internalization capabiUty and/or increased complement-mediated ceU killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1 191- 
1 195 (1992) and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifimctional cross-linkers as 
described in Wolffet al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody 
can be engineered that has dual Fc regions and can thereby have enhanced complement lysis 
and ADCC c^abilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 
Immuhoconj ugates 

The invention also pertains to immunoconjugates comprising an antibody conjugatedv 
to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active . • 
toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 
isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useftd in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and Augments thereof that can be used 
include diphtheria A chain, . nonbinding active fiiagments of diphtheria toxin, exotoxin A chain 
(fi-om Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccia A chain, alpha-saicin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPE, 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinaUs inhibitor, ' 
gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples 
include 2»2Bi, "V '°Y, and '86Re. 

Conjugates of the antibody and cytotoxic agent are made.using a variety of 
bifiinctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 
(SPDP), iminothiolane (JT), bifunctional derivatives of unidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 
2,4-dinitrobenzene). For example, a ricm immunotoxin can be prepared as described in 
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Vitetta et al., Scienc;e..238: lQ9.8.(1987)..Carbon-14rlabeled:l-isothiGeyanatoben2yl.3. 
methyldiethylene triammepentaacetic acid (MX-DTPA) is an exemplary. chelating agent for 
. conjugation of radionucleotide to flie antibody.. See W094/1 1 026. . 

In another embodiment, the antibody can.be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein.the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agait and then administration of a "ligand" (e.g.,.avidin) that is in turn 
conjugated to a cytotoxic agent. 

Tn one embodiment, metiiods for the screening of antibodies that possess the desired 
specificity include, but are not Hmited to, enzyme-linked immunosorbent assay (ELISA) and 
other immunologically-mediated techniques known within the art. In a specific embodiment, 
selection of antibodies that are specific to a particular domain of an NOVX protein is 
facihtated by generation of hybridomas that bind to the fragment of an NOVX protein 
possessing such a domain. Thus, antibodies that are specific for a desired domain within an 
NOVX protein, or derivatives, firagments, analogs or homologs thereof are also provided 
hereiiL 

Anti-NOVX antibodies may be used in methods known within the art relating to flie 
localization and/or quantitation of an NOVX protem (e.g., for use in measuring levels of the 
NOVX prdtein within appropriate physiological samples, for use in diagnostic mefeods, for 
use m imagmg the protein, and the like). In a given embodiment, antibodies for NOVX 
proteins, or derivatives, fiagments, analogs or homologs thereof, that contain the antibody 

derived binding domain, are utilized as pharmacologicaUy-active compounds (he^^^ 
"Therapeutics"). 

^^ti-NOVXantibody(e.^.,monoclonalantibody)canbeusedtoisolateanNOVX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipitation. 
An anti-NOVX antibody can facilitate the purification of natural NOVX polypeptide fiom 
cells and of recombinantly-produced NOVX polypeptide expressed in host cells. Moreover, 
an anti-NOVX antibody can be used to detect NOVX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the NOVX 
protein. Anti-NOVX antibodies can be used diagnostically to monitor protein levels in tissue 
as part of a clinical testing procedure, e.g., to, for example, detemnne the efficacy of a given 
treatment regunen. Detection can be facihtated by coupling (z.e.. physically liridng) the 
antibody to a detectable substance. Examples of detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent mat^als, bioluminescent 
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materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, P-galactosidase, or acetylcholinesterase; examples of 
suitable prosthetic group complexes include st^eptavidin^iotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
5 rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of bioluminescent materials include 
luciferase. luciferin, and aequorin, and examples of suitable radioactive material include '^'l, 
'^^I,^^Sor^H. 

NO VX Recombinant Expression Vectors and Host Cells 

10 -Another aspect of the invention pertains to vectors, preferably ejcpression vectors, 

containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable 
of transporting another nucledc acid to which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA loop into which additional DNA 

1 5 segments can be ligated. Another type of vector is a viral vector, wherein additional DNA 
segments can be Ugated into the viral genome. Certain vectors are capable of autonomous 
repUcation in a host cell into which they are introduced (e.g., bacterial vectors having a, 

bacterial origin of rephcation and episomal mammaUan vectors). . Other vectors 
non-episomal mammaUan vectors) are integrated into the genome of a host cell upon 

20 introduction into the host cell, and thereby are replicated along with the host genome. 

Moreover, certain vectors are enable of directing the expression of genes to which they .are 
operatively-linked. Such vectors are referred to herein as "expression vectors". In general, 
expression vectors of utiUty in recombinant DNA.techniques are often in the form of plasmids. 
In the present specification, "plasmid" and "vector" can be used interchangeably as the 
25 plasmid is the most commonly used form of vector. However, the invention is intended to 

include such other forms of expression vectors, such as viral vectors (e.^., repUcation defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a fonn suitable for expression of the nucleic acid in a host cell, which means that : 
the recombinant expression vectors include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, lhat is operatively-hnked to the nucleic acid 
sequence to be expressed. Within a recpmbinant expression vector, "operably-linked".is .. 
intended to mean that the nucleotide sequeiice of interest is linked to the regulatory 
sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in 
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vitro transcription/translation system bir in a host ceU when the vectdr is introduced into the 
host cell). . .. 

The temi "regulatoiy sequence" is intended to includes promoters, enhancers and 
expression control elements {e.g., polyadenylation signals). Such regulatory sequences are 

5 described, for example, in Goeddel,GEhffi Expression Technology: Methods IN • 

Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutiye expression of a nucleotide sequence in many types of host cell 
and those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
10 design of the expression vector can depend on such factors as the choice of the host cell to.be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, includmg 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g^.,NOVX 
proteins, mutant forms of NOVX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
NOVX proteins in prokaxyotic or eukaryotic cells. For example, NOVX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect ceDs (using baculovirus expression 
. vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Gbeddel, 
Gene EXPRESSION Technology: Methods in Enzymology 185, Academic Press, San 
Diego. Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 
. : - Expression of proteins in prokaryotes is most often earned out m Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the ammo terminus of the recombinant protein. Such fiision vectOTs ' 
typically serve threepuiposes: (i) to increase expression of recombinant protein; (li) to 
increase the solubility of the recombinant protein; and (m) to aid in the pimfication of thie 
recombinant protein by acting as a hgand in affinity purification. Often, iii fusion expression 
vectors, aproteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein fioni the fiision moiety ' 
subsequent to purification of the fiision protein. Such enzymes, and their cognate recognition 
sequences, include Factor.Xa, thrombin and enterokinase. Typical fusion expression vectors 
include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL 
(New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fiise 
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glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the 
target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectora include pTrc 
(Amrann et aL, (1988) Gene 69:301-315) and pET 1 Id (Studier et al.. Gene EXPRESSION 
TtoiNOLOGY: Methods IN Enzymology 185^ Academic Press, San Diego, Calif. (1990) 
.60-89).. 

One strategy to maximize recombinant protein expression in E. coli is to express the 
protein in a host bact«ia with an impaired capacity to proteolytically cleave the recombinant 
protein. See. e.g., Gottesman, Gene Expression. Technology: Methods in Enzymology 
185, Academic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alterthe 
nucleic acid sequence of lOie nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utihzed in.e. coli {see. e.g.. 
Wada, et al, 1992. Nucl. Acids Res. 20: 21 1 1-21 1 8). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

hi another embodimient, the NOVX expression vector is a yeast expression vector. " 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldaii, et al, 1987. EMBO J. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30: 
933-943), PJRY88 (Schultz e/ a/., 1987. GeneSA: 113-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, NOVX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
SF9. cells) include the pAc series (Smith, et al, 1983. Mol CeU. Biol 3: 2156-2165) and the 
pVL series (Lucklow and Simmers, 1989. Firo/ogv 170: 31-39). 

hi yet another embodiment, a nucleic acid of the invention is expressed ii mabinahan 
cells using a mammalian expression vector. Examples of mammaUan expression vectors 
include pCDMS (Seed, 1987. Nature 329: 840) andpMT2PC (Kaufinan, et al, 1987. EMBO 
J. 6: 187-195). When used in mammalian cells, the expression vector's conHol functior^ are 
often providedbyvural regulatory elements. For example, commonly used promote are 
derived from polyoma, adenovirus 2, cytomegalovirus, and:simian virus 40. For other suitable 
expression systems for both prokaryotic and eukaiyotic cells see, e.g.. Chapters 16 and 17 of 
Sambrook, et al. Molecular CLONING: A Laboratory Manual. 2nd ed., Cold Spring 
Hairbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

hi another embodiment, the recombioant mammalian expression vector is capable of 
dh-ectingexpression of the riucleic acid preferentially in a particular cell type {e.g.. 
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tissue-specific regulatory elements-are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, fl/., 1987. CJfenes i^^^ 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 
5 . 235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EAdBO J. 
.8: 729-733) and immunoglobulins (Banerji, et al., 1983. Cell 33 : 729-740; Queen and; 
Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament 
promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), - 
pancreas-specific promoters (Edlund, et a/., 1985. Science 230: 912-916), and mammary 
10 gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European 
AppUcation Publication No. 264,1 66). Developmentally-regulated promoters are also 
encompassed, e.g., the murine box promoters (Kessel and Gruss, 1990. Science 249: 374-379) 
and the a-fetoprotein promoter (Campes and Tilgjunan. 1989. Genes Dev. 3: 537-546). 
• • The invention fijrther provides a recombuiant expression vector compriang a DNA 
15 molecule of the invention cloned into the expression vector in an ahtisense orientation: That 
is, the DNA molecule is operatively-linked to a regulatory sequence in a mamier fliat allows 
for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 
. NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences.can be chosen that direct constitutive, tissue specific or cell type specific expression 
of antisense RNA. The antisense expression vector can be in the fomi of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efSciency regulatory region, the activity of which can be determined by the 
cell type into which the vector is intiroduced. For a discussion of the regulation of gene 
expression using antisense genes see. e.g., Weintiraub, et al., "Antisense RNA as a molecular 
tool for genetic analysis,'' i?evzew5-rre«ife z« Gene/ICS, Vol. 1(1) 1986. 

Anoflier aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The temis "host cell"- and 
"recombinant host ceU" are used interchangeably herein. It is understood tiiat such terms refer 
not only to the particular subject cell but also to tiie progeny or potential progeny of such a 
cell. Because certain modifications may occur in succeeding generations due to either 
mutation or environmental mfluences, such progeny may not, in fact, be identical to the parent 
cell, but are still included within the scope of the terai as used herein. 
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Ahostcellcanbeanyprokaryoticoreukaryoticcell. For example, NOVX protein can 
be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitJible host cells are known to 
those skilled in the arti 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the tarns "transformation" and 
"transfection" are intaaded to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid {e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be found in 
Sambrook, et al. (MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed.. Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), 
and other laboratory manuals. 

' - For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the 
host cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as tiiat encoding 
NOVX or can be introduced on a separate vector. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection (e.g. , cells that have incorporated the ' 
selectable marker gene will survive, while the other cells die). • . ; • 

A host ceU of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (i.e., express) NOVX protein. Accordingly, the invention further pri>vides 
methods for producing NOVX protein using the host cells of the invention. In one ' 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein firom the medium or the host cell; 

Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertihzed oocyte or 
an embryonic stem cell into which NOVX protein-coding sequences have been introduced. 
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Such host cells can then be used to create non-human transgenic animals in which exogenous 
NOVX sequences have been introduced into their genome or homologous recombinant 
animals in which endogenous NOVX sequences have been altered. Such animals are useful 
for studying the function and/or activity of NOVX protein and for identifying m^^^ 
evaluating modulators of NOVX protein activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a transgene. Other examples of " 
transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, ' 
amphibians, etc. A transgene is exogenous DNA that is integrated into the genome of a cell 
from which a transgenic animal develops and that remains in the genome of the mature 
animal, thereby directing the expression of an encoded gene product in one or more cell types 
or tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a 
non-human animal, preferably amammal, more preferably a mouse, in which an endogenous 
NOVX gene has been altered by homologous recombination between the endog^ous gene 
and an exogenous DNA molecule introduced into a cell of the animal. e.g., an «nbryonic ceU 
of the animal, prior to development of the animal. 

...... A transgenic animal ofthe invention can be created by introducing NOVX-e^ 

nucleic acid into the male pronuclei of a fertilized oocyte (e.^., by microinjection, retroviral 

mfection)andallowingtheoocytetodevelopinapseudopregnantfemalefosteranimal The 

human NOVX cDNA sequences SEQ ID NOS:l, 3, 5, '7, 9, 11, 13, 15, 17, 19, 21, 23, 25 27 
.29, 31 and 33 can be introduced as atransgene into the genome of a non-human animal ' 
Alternatively, a non-hmnan homologue of the human NOVX gene, such as a mouse NOVX 

gene, can b^ isolated based on hybridization to the human.NOVX cDNA (described furflier 
supra) and used as a transgene. Intronic sequences andpolyadenylation signals can also be 

mcludedmthetransgenetoincreasetheefBciencyofexpressionofthetransgene. A 
tissue-specific regulatory sequence(s) can be operably-linked to the NOVX tr^gene to direct 
expression of NOVX protein to particular cells. Methods for generating transg^c animals 
via embryo manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and,are described, for example, in U.S. Patent Nos. 4,736,866- 
4,870,009; and 4,873.191; and Hogan, 1986. In: Manipulating IHE MOUSE Embryo= Cold ■ 
SprmgHarborLaboratoryPress.ColdSpringHarbor.N.Y. Sinular methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified based 
upon the presence of the NOVX transgene in its genome and/or expression of NOVX mKNA 
m tissues or cells of the animals. A transgenic founder animal can then be used to breed " 
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additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene- 
encoding NOVX protein can further be bred to other transgenic animals carrying otiier 
transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at 
5 least a portion of an NOVX gene into which a deletion, addition or substitution has been 

introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene can 
be a human gene (e.g., the cDNA of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31 and 33), but more preferably, is a non-human homologue of a human NOVX geae. 
For example, a mouse homologue of human NOVX gene of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 
10 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33 can be used to construct a homologous recombination 
vector suitable for altering an endogenous NOVX gene in the mouse genome. In one ■ 
embodiment, the vector is designed such that, upon homologous recombination, the 
endogenous NOVX gene is functionally disrupted (z.e., no longer encodes a functional protein; 
also referred to as a "knock out" vector). 
15 Alternatively, the vector can be designed such that, upon homologous recombination, 

the endogenous NOVX gene is mutated or otherwise altered but still encodes functional 
protein (e.g^., the upstream regulatory region can be altered to thereby alter the expression of 
the endogenous NOVX protein). In the homologous recombinatioh vector, the altered portion 
of the NOVX gene is flanked at its 5'- and 3'-termini by additional nucleic acid of the NOVX 
20 gene to allow for homologous recombination to occur between the exogenous NOVX g^e 
carried by the vector and an endogenous NOVX gene in an embryonic stem cell. The - 
additional flanking NOVX nucleic acid is of sufficient length for successful homologous 
recombination with the endogenous gene. Typically, several kilobases of flanldng DNA (both 
at the 5'- and 3'-termini) are included in the vector. See. e.g.. Thomas, ei al, 1987. Cell SI: 
25 503 for a description of homologous recombination vectors. The vector is ten introduced into • 
an embryonic stem ceU hne (e.g. , by electroporation) and cells in which the introduced NOVX 
gene has homologously-recombined with the endogenous NOVX gene are selected. See. e.g., 
U,etal., 1992, Cell 69: 915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 
30 form aggregation chimeras. See. e.g., Bradley, 1987. La: Teratocarcinomas and • 

EMBRYONIC STEM CELLS: A PRACTICAL APPROACH, Robertson, ed. JRL, Oxford, pp. 113-1 52. 
A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal 
and the embryo brought to term. Progeny harboring the homologously-recombined DNA in 
then- germ cells can be used to breed animals in which all cells of the animal contain the 

117 



BNSDOCID: <WOl_02055702A2J_> 



wo 02/055702 W ^ 3PeTAJS01/50925 

homologously-recomljined DNA by gennline. transmission, of the . transgene,- Methods for 
constructing homologous recombination vectors and homologous. recombinant animals are 
described further in Bradley. 1991. Curr. Opin. Biotechnol 2: 823-829; PCT International 
PubHcationNos.: WO 90/11354; WO 91/01140; WO 92/0968; and WO 93/04169. 

5 ^^°*^«^?inbodiment,transgenicnon.humansanimalscanbeproducedthatcont^ 
selected systems that allow for regulated expression, of the transgene. One example of such a 
system is the cre^oxP recombinase. system of bacteriophage PI. For a description of the 
cre/IoxP recombmase system. See. e.g., Lakso. et al, 1992. Proc. Natl. Acad. Set USA 89: 
6232-6236. Another example of a recombinase system is the FLP recombinase system of 
10 Sax^charomyces cerevisiae. See. O'Gorman, et al, 1991. Science 251:1351-1355. If a cre/loxP 

recombinase system is used to regidate expression of the transgene, animals cont^^ 
transgenes encoding both the Cre recombinase and a selected protein are inquired. Such 
animals can be provided through the construction of "double" transgenic animals. e.g., by 
mating two transgenic animals, one containing a transgene encoding a selected pioteii! and the 
15 other containing a transgene encoding a recombinase. • . . . 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wihnut. et al, 1997. Nature 385: 810-813. In brie^ a 
cell ie.g., a somatic pell) from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter Go phase. The quiescent cell can llxen be fused, e.g., through the use of 
electacal pulses, to an enucleated oocyte from an animal of the same species from which the 
qmescent cell is isolated. THe reconstructed oocyte is then cultured such that it develops to 
morula or blastocyte and then transferred to pseudopregnant female foster animal The 
offepring b^e of this female foster animal will be a clone of the animal from which the cell 
(e.^„ the somatic cell) is isolated, 
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The NOVX nucleic acid molecules. NOVX proteins, and anti-NOVX antibodies (also 
referxBd to herein as "active compounds") ofthe invention, and derivatives, fragments, analogs 
and homologs Iher^f. can be incorporated into pharmaceutical compositions suitable for 
admmistration.^ Such composition^ typically comprise the nucleic acid molecule, protein or 
antibodyandaphaimaceuticallyacceptablecairier. As used herein, "phannaceutically ' 
acceptable carrier" is intended to include any and aD solvents, dispersion media, coatings 
antxbactenal and antifungal agents, isotonic and absorption delaying agents,' and the like 
compatible with pharinaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field. . 
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Which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to', water, saline, finger's solutions, dextrose solution, and 5% 
human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be 
used. The use of such media and agents for phaimaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with Ihe active 
compound, use th^eof in the compositions is contemplated. Supplementary tetive 
compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the mvention is formulated to be compatible wilh its 
intended route of administration. Examples of routes of administration include parenteral, 
e.g., intravenous, intradennal, subcutaneous, oral (e.g:, inhalation), transdennal (i.e.. topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous appHcation can include the foUowmg components: a sterile 
diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, 
propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such 
as ethylenediaminetetraacetic add (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can bfe enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. . 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administratioii," 
suitable carriers include physiological sahne, bacteriostatic water, Cremophor EL~ (BASF, 
Parsippany, N.J.) or phosphate buffered sahne (PBS). In all cases, the composition must bl 
sterile and should be fluid to the extent that easy syringeabihty exists. It must be stable under 
Ihe conditions of manufacture and storage and must be preserved against the contaminating 
action ofmicroorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and hquid polyethylene glycol, and the Uke). and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by 
the maintenance of the required particle size in the case of dispersion and by the use of 

surfactants: Prevention ofthe action ofmicroorganisms can be achieved by various • ■ 
antibacteiial and antifimgal agents, for example, parabens. chlorobutanol, phenol, ascorbic. 
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. acid, thimerosal, and the like;. In many cases, it will be preferable to iiiclude isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the ' 
composition. Prolonged absorption of the injectable compositions can be brought about by 
including in.the composition an agent which delays absorption, for example, aluminum 
monpstearate and gelatin. • 

Sterile injectable solutions can be prepared by incorporating the active compound (e.^., 
an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterihzation. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. Li die case of sterile powders for the prepaiatidn-of-sterile iisectable 
solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active mgredient plus any additional desired ingredient from a previously sterile-filtered • 
solution thereof. , . . . , 

Oral compositions generaUy include an inert diluent or an edible earner. T^^ 
enclosed in gelatin c^sules or conq>ressed into tablets. For the purpose of oral thef^eutic 
administration, the active compound can be incorporated with excipients "and used in the form 
of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid earner 
for use as a mouthwash, wherein the compound in the fluid carrier is apphed orally and 
swished and expectorated or swallowed. Phaimaceutically compatible binding agents, and/or ■ 
adjuvant materials can be included as part of Ihe composition. The tablets, pills, capsules, 
troches and the like.can contain any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient 
such as starch or lactose, a disintegrating agent such as alginic acid,. Primogel, or corii starch; a 
lubncant such as magnesium stearate or Sterotes; a glidant such as colloidal siUcon dioxide- a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered m the form of an • 
aerosol spray from pressured container or dispenser which contains a suitable propellant. e.g., 
30 a gas such as carbon dioxide, or.a nebuhzer - 

Systemic administration can also be by transmucosal or transdermal mera^^ 
transmucosal.ortransdermaladministration,penetrantsappropriatetothebm^^ : 
penneated are used iri fhefonnulation.. Such penet^^^^ 

include, for example, for transmucosal administration, detergents, bile salts, and fizsidic acid 
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derivatives. Transmucosal administration can be accomplished through ttie use of nasal sprays 
or suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
5 conventional suppository bases such as cocoa-butter and other glycerides) or retention enemas 
for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
10 biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, ' 

polyglycoUc acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antil)odies to viral 
15 antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared 
according to methods known to tiiose skilled in the art, for example, as described in U.S. 
Patent No. 4,522.811. 

It is especiaUy advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
20 herein refers to physically discrete units suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active compound calculated to produce the 
desired tiierapeutic effect in association with the required pharmaceutical carrier. The ' 
specification for tiie dosage unit forms of the invention are dictated by and directly dependent 
on the unique characteristics of the active compound and the particular tiier^eutic effect to be 
achieved, and the limitations inherent in tiie art of compounding such an active compound for 
the treatment of individuals. 

The nucleic acid molecules of tiie invention can be inserted into vectors and used as 
gene tiier^y vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intiavenous injection, local administration (see. e.g.. U.-S;- Patent No. 5,328,470) or by 
stereotactic injection (see. e.g., Chen, et al, 1994. Proc Natl Acad. Sci. USA 91: 3054-3057). 
The pharmaceutical preparation of tiie gene tiierapy vector can include tiie gene tiierapy vector 
in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where tiie complete gene delivery vector can be produced 
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intact from recombinant cells. e.g.i retroviral" Vectors, the phainiaceutical preparation can 
include one or more cells that produce the gene deUvery system. 

• The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to e}q3ress NOVX 
protein (e.g., via a recombinant expression vector in. a host cell m gene therapy apphcations), 
to detect NOVX mRNA (e.g. , m a biological sample) or a genetic lesion in an NOVX gene, 
and to modulate NOVX activity, as described further, below. In addition, the NOVX proteins 
can be used to screen drugs or compounds that modulate the NO VX protein activity or 
expression as weU as to treat disorders characterized by insuffici^t or excessive production of 
NOVX protein or production of NOVX protein forms that have decreased or abenant activity 
compared to NOVX wild-type protein (e.^.; diabetes (regulates insuUn release); obesi^ (binds 
and transport hpids); metaboUc disturbances associated with obesity, the metaboUc syndrome 
X as well as anorexia and wasting disorders associated with chronic diseases and various 
cancers, and mfectious disease(possesses anti-microbial activity) and the various . 
dyshpidemias. hi addition, the anti-NOVX antibodies of flie iiivention can be used to detect 
and isolate NOVX proteins and modulate NOVX activity. In yet a further aspect,. ihe invention 
can be used in methods to influence appetite, absoiptioii of nutrients and the diqiosition of 
20 metabohc substrates in both a positive and negative feshion.. 

'^^^^^^tionfurthCTpertainstonovelagentsidentifiedbythescr^^ 
described Wein and uses thereof for treatments as descn^^^^ 

Screening Assays - 

The invention provides a method (also referred to h^ein as a "screening assay-O 
identifying modulators. candidate or test conq,ounds or agente (eg., peptides, 
peptidomhnetics, smaU molecules or other drugs) that bind to NOVX prot^ or have a " 
stimulatory or mhibitory effect on. eg., NOVX protein «q,ression or NOVX protein activity. 
The mvention also includes compounds identified in the screening assays described herein. 

: one embodiment, the invention provides assays for screening c^^^^^ 
compounds which bind to Or modulate the activity of the membrane-bound form of an NOVX 
protein of polypeptide or biologically-active portion thereof The test compounds of tiie 
invention can be obtained using any of the numerous approaches in combinatorial' library 
methods known m the art, includmg: biologic^ libraries; spatially addressable paraUel sohd 
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phase or solution phase libraries; synthetic library methods requiring deconvblution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide libraries, 
while the other four approaches are applicable to peptide, non-peptide oMgomer or small 
5 molecide hbraries of compounds. iSee, e.^.. Lam, 1997. ^nrtcflncerDr^^ 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, 
lipids or other organic or inorganic molecules. Libraries of chemical and/or biological 
10 mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can be screened 
with any of the assays of the invention. 

Exainples of methods for the synthesis of molecular Ubraries can be found in the art, 
for example in: DeWitt, et al, 1993. Proc. Natl Acad. ScL U.SA. 90: 6909; Eib, et al, 1994. 
Proc. Natl Acad. ScL U.S.A. 91: 11422; Zuckennann, a/., 1994. J. Med. Chem. 37: 2678; 
15 Cho, et al, 1993. Science 261: 1303; Cairell, et aL, 1994. Angew. Chem. Int. Ed. Engl 33: 
2059; Carell, et al, 1994. Angew. Chem. Int. Ed Engl 33: 2061; and Gallop, et al, 1994. •/ 
Med^ Chem. 31: 1233. 

Libraries of compounds may be presented in solution Houghten, 1992. 
Biotechniques 13: All-All), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 
20 1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (CuU, et a/., 1992. Proc. Natl Acad. Set USA 89: 
1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 
249: 404-406; Cwrrla, et al, 1990. Proc. Natl Acad. Set U.S.A. 87: 6378-6382; Fehci, 1991. 
J. Mol Biol 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 
25 In one embodiment, an assay is a cell-based assay in which a cell which expresses a 

membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the abiUty of the test compound to bind to an 
NOVX protein detOTnined. The cell, for example, can of mammalian origin- or a yeast cell. 
Detemiining the ability of the test compound to brad to the NOVX protein can be 
30 accomphshed, for example, by coupling the test compound with a radioisotope or enzymatic 
label such that binding of the test compound to the NOVX protein or biologically-active 
portion thereof can be determined by detecting the labeled compound in a complex. For 
example, test compounds can be labeled v/ith -^^^I, ^^S, ^"^C, or ^H, either directly or indirectiy, 
and the radioisotope detected by direct counting of radioemission or by scintiUation counting. 
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Altemativelyi test compojinds can be enzymatically-labeled- with? for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by: • ■■ 
determination of conversion of an ^ropriate substrate to product In one embodiment, the 
assay comprises contacting a cell which expresses a membiane-bound foim of NOVX protein, 
5 or a biologically-active portion thereof on Ihe cell surface with a known compound which 
binds NOVX ,to form an assay mixture, contacting the assay mixture with a test compound, 
and detemiining the ability of the test compound to interact with an NOVX protein, wherein 
. determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the test compound to preferentially bind to NOVX protein or a 
10 biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound fonn of NOVX protein, or a biologically-active portion thereof 
on the cell surface with a test compound and determining the ability of the test compound to 
modulate {e.g., stimulate or inhibit) the activity of the NOVX protein or biologically-active 
portion thereof. Determining the abiUty of the test compound to modulate the activity of 
NOVX or a biologically-active portion thereof can be accoiiq)lished, for example, by ' 
determining the ability of the NOVX protem to bmd to or interact with an NOVX target 
molecule. As used herein, a "target molecule" is a molecule with which an NOVX protein 
bmds or interacts in nature, for example, a molecule on the surface of a cell which expresses 

an NOVX interacting protein, a molecule on the surface of a second cell, a molecule^ 
extracellular miheu, a molecule associated with the internal surface of a cell membrane or a 
cytoplasmic molecule. An NOVX target molecule can be a non-NOVX molecule or ah- 
NOVX protein or polypeptide of the invention. In one embodiment, an NOVX target 
molecule is a component of a signal transduction pathway that facilitates transduction of an 
extracellular signal {e.g. a signal generated by binding of a compound to a membrane-bound 
NOVX molecule) through the cell membrane and mto the cell. The target," for example, can be 
a second intercellular protein that has catalytic activity or a protein that faciUtates the - 
association of downstream signaling molecules with NOVX. 

. Determining the abiUty.of the. NOVX protein to bmd to or interact with an NOVX 
target molecule can be accompKshed by one of the methods described above for determining - 
direct binding. In one embodiment, determining the abihty of the NOVX protem to bind to or 
interact wth an NOVX target molecule can be accomphshed by determining the activity of the 
target molecule. For.example, the activity of the target molecule can be deteimiried by = 
detecting induction of a cellular second messenger of the target {i.e. intracellular Ca^""; • 
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diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an appropriate 
substrate, detecting the induction of a reporter gene (comprising an NOVX-responsive 
regulatory element operatively linked to a nucleic acid encoding a detectable marker^ e,g.^ 
luciferase), or detecting a cellular response, for example, cell survival, cellular differentiation, 
or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an NOVX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to bind to the NOVX protein or biologically- 
active portion thereof Binding of the test compound to the NOVX protein can be determined 
either directly or indirectly as described above. In one such embodiment, the assay comprises 
contacting the NOVX protein or biologically-active portion thereof with a Jcnown compound ' 
which binds NOVX to form an assay mixture, contacting the assay mbcture with a test 
compoimd, and determining the ability of the test compound to interact with an NOVX 
:;protein, wherein determining the ability of the test compound to interact with an NOVX 
protem comprises detenrmxing the ability of the test compound to preferentially bind to.NOVX 
or biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting NOVX 
protein or biologically-active portion thereof with a test compound and detemiining the ability 
of the test compound to modulate (e.g, stimulate or inhibit) the activity of the NOVX protein 
or biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX can be accomplished, for example, by detemiining the ability 
of the NOVX protein to bind to an NOVX target molecule by one of the methods described 
above for detemiining direct binding. In an alternative embodiment, determining the ability of 
the test compound to modulate the activity of NOVX protein can be accomplished by 
determining the ability of the NOVX protein further modulate an NOVX target molecule. For 
example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate 
can be determined as described, jM/?ra. 

In yet another embodiment, the cell-free assay comprises contacting the NOVX protein 
or biologically-active portion thereof with ^ known compound which binds NOVX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and detemiining 
the ability of the test compound to interact with an NOVX protein, wherein detemiining liie 
ability of the test compound to interact with an NOVX protein comprises detemiining the 
ability of the NOVX protein to preferentially bind to or modulate the activity of an NOVX 
target molecule. . . . ^ 
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The ceU-fi^e assays of the invention are amenable to use ofbolh the soluble 
the membrane-bound fonn of NOVX protein. In the case of cell-free assays comprising the 
membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing agent 
such that the membrane-bound form of NOVX protein is maintained in solution. Examples of 
5 . such solubilizing agents include non-ionic detergents such as n-octylglucoside, • • 
n-dodecylglucoside, n-dodecyhnaltoside, octanoyl-N-methylglucamide, 

decanoyl-N-methylglucamide, Triton® X-100, Triton® X-1 14, Thesit®, - 

Isotridecypoly(ethylene glycol ether)„, N-dodecyl~N,N-dimethyl-3-ammonio-l-propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 

10 3-(3-cholamidopropyl)dimethylammimol-2-hydroxy-l-propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention; it may be 
desirable to immobihze either NOVX protein or its target molecule to facilitate separation of 
complexed from uncomplexed forms of one or both of the proteins, as well as to acconmiodate 
automation of the assay. Bindingof a test compound to NOVX protein, or interaction of 

15 NOVX protein with a target molecule in the presence and absence of a candidate compound 
can be accomphshed in any vessel suitable for containing the reactants. Examples of such 
vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided that adds a domain that allows one or both of the proteins to be 

bound to a matrix. For example, GST-NOVX fusion proteins or GST-target fusion proteins 
20 can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St Louis, MO) or 

glutathione derivatized microtiter plates, that are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or NOVX protein, and the mixture is 
incubated under conditions conducive to complex formation {e.g., at physiological conditions 
for salt and pH). Following incubation, the beads or microtiter plate wells are washed to 
remove any unbound components, the matrix immobihzed in the case of beads, conq)lex 
detennined either directly or indirectly, for example, as described, supra. Altematively, the 
complexes can be dissociated from the matrix, and the level of NOVX protem binding or 
activity detennined using standard techniques. 

Other techniques for irnmobilizing protems on matrices can also be used in tiie 
30 screening assays of the invention. For example, either the NOVX protein or its target - ' 

molecule can be immobilized utihzing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS : 
(N-hydroxy-succinimide) using techniques well-known within the art {e.g., biotinylation kit. 
Pierce Chemicals, Rockford, HI.), and immobilized in the wells of streptavidin-coated 96 well 
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plates (Pierce Chemical). Alternatively, antiHodies reactive with NOVX protein or target 
molecules, but which do not interfere with binding of the NOVX protein to its target molecule, 
can be derivatized to the wells of the plate, and unbound target or NOVX protein trapped in 
the wells by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the NOVX protein or target molecule, as well as enzyme-hnked 
assays that rely on detecting an enzymatic activity associated with the NOVX proteiii or target 
molecule. 

In another ranbodiment, modulators of NOVX protein expression are identified in a 
method wherein a ceU is contacted with a candidate compound and the expression of NOVX 
mRNA or protein in the cell is determined. The level of repression of NOVX mRNA or 
protein in the presence of the candidate compound is compared to the level of expression of 
NOVX mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of NOVX mRNA or protein expression based 
upon this comparison. For example, when e^qpression of NOVX mRNA or protein is greater 
(z.e., statistically significantly greater) in flie presence of flie candidate compound than in its 
absence, the candidate compound is identified as a stimulator of NOVX mRNA or protein 
expression. Alternatively, when expression of NOVX mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of NOVX mRNA or protein expression. The level of 
NOVX mRNA or protein expression in the cells can be determined by methods described 
herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 
proteins" in a two-hybrid assay or fliree hybrid assay {see. e.g., U.S.' Patent No. 5,283,3 17; 
Zervos, etaL, 1993. CeU 72: 223-232; Madura, a/., 1993. J. Biol Chem. 26Z: 12046-12054; 
Bartel, et al., 1993. Biotechnigues 14: 920-924; Iwabuchi, et ah, 1993. Oncogene 8: 
1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact with 
NOVX ("NOVX-binding proteins" or "NOVX-bp") and modiilate NOVX activity. Such 
NOVX-bmding proteins are also likely to be involved in the propagation of signals by the 
NOVX proteins as, for example, upstream or downstream elements of the NOVX pathway. 

The two-hybrid system is based on the modular nature of inost transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utiUzes 
two different DNA constructs. In one construct, the gene that codes for NOVX is fused to a 
gene encoding the DNA binding domain of a known transcription fector (e.g., GAL-4). In the 
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Other construct, a DNA sequence, from a library ofDNA sequenc.es,.;that encodes an . 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the.activation 
domain of the known transcription fector. If the "bait" and the. "prey" proteins are able to 
interact, m vivo, forming an NOVX-dependent complex, the DNA-binding and activation 
domdnsofthe transcription factor are br^^ This proximily aUows • 

tr^scription of a reporter gene (e.g. , LacZ) that is. qperably linked to a transcriptional 
regulatory, site responsi ve to . the transcription .fector. Expression of the reporter gene can be 
detected and cell colonies containing the functional transcription factor can be isolated and 
used to obtain the cloned gene that encodes the protein which interacts with NOVX. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described, herem. 
Detection Assays 

Portions or fi^gmeiits of the cDNA sequences identified herein (and the co^^^ 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way 
of example, and not of limitation, these sequences can be used to: (z) map their respective 
genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (zz) 
idddtify an individual fiom a minute biological sample (tissue typing); and (in) aid ui forensic 
identification of a biological sample. Some of these ^plications are described in the 
subsections, below. 

20 Chromosome Mapping 

Qnpe the sequence (or a portion of the sequence) of a gene has been isolated this 
sequence can .be used to map the location ofthe gene on a chromosome. Tins process is called 

chromosome mapping. Accordingly, portions or fragments of the NOVX sequences. SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31- and 33, or fragments or derivatives " 
thereof, can be used to map the location ofthe NOVX genes, respectively, on a chromosome 
The mapping ofthe NOVX sequences to chromosomes is an important first step in cdirelating 
these sequences with genes associated with disease. . . - 

Briefly, NOVX genes can be mapped to chromosomes by preparing PGR primers 
(preferably 15-25 bp m length) from the.NOVX sequences. Computer analj^is ofthe NOVX, 
sequences can be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be used 
for PGR screening of somatic cell hj^rids containing individual human chromosomes. Only 
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those hybrids containing the human gene corrissponding to the NOVX sequences will yield an 
amplified fragment 

Somatic cell hybrids are prepared by fusing somatic cells from different manunals 
(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, they 
gradually lose human chromosomes in random order, but retain the mouse chromosomes. By 
using media in which mouse cells cannot grow, because they lack a particular enzyme, but in 
which human cells can, the one human chromosome that contains the gene encoding the 
needed enzyme will be retained. By using various media, panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 
number of himian chromosomes, and a full set of mouse chromosomes, allowing easy 
mapping of individual genes to specific human chromosomes. See, e:g.. D'Eustachio, et al., 
1983. Science 220: 919-924. Somatic cell hybrids containmg only fragments of human 
chromosomes can also be produced by using human chromosomes witti translocations and 
deletions. 

PGR moping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using 
a single thermal cycler. Using the NOVX sequences to design oKgonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase . 
chromosomal spread can fijrther be used to provide a precise chromosomal location in one 
step. Cairomosome spreads can be made using ceUs whose division has been blocked in 
metaphase by a chemical like colcemid lliat disrupts the mitotic spindle. The chromosomes 
can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and dark 
bands develops on each chromosome, so that the chromosomes can be identified individually. 
The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have ahigher likelihood of binding to a unique 
chromosomal location with sufficient signal intensity for simple detection. Preferably 1,000 
bases, and more preferably 2,000 bases, will suffice to get good results at a reasonable amount 
of time.. For a review of this technique, see. Vemia, et cd.. Human Chromosomes; A 
Maiwal OF Basic Techniques (Pergamon Press, New York 1988). 

Reagents for chromosome miapping can be used individually to mark a single 
chromosome or a single site on that chromosoine, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are more 
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likely to be conserved within gene families, thus.inca-easing the chance of cross hybridizations 
during chromosomal n[i^q)ing. . 

Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
5 data are found, e.g., in McKusick, Mendeeian Inheritance in Man, available on-line 

through Johns Hopkins University Welch Medical Library). The relationship between genes 
and disease, mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, et al, 1987. 
Nature, 325: 7S3-787. .... 

10 Moreover, differences in the DNA sequences between individuals affected and 

unaffected with a disease associated with the NOVX gene, can be determined. If a mutation is 
observed in some or aU of the affected individuals but not in any unaffected individuals, then 
the mutation is hkely to be the causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally mvolves first looking for structural alterations in 

1 5 the chromosomes, such as deletions or translocations that are visible from chromosome 
spreads or detectable usmg PGR based on that DNA sequence. Ultimately, conq)lete 
sequencing of genes firom several individuals can be performed to confirm the presence of a 
. mutation aiid to distinguish mutations from polymoiphisms. 
Tissue Typing 

The NOVX sequences of the invention can also be used, to identify individuals from 
minute biological samples, hi this technique, an individual's genomic DNA is digested with 
one or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention,are useful as additional DNA markers for RFLP 
Crestiiction firagment length polymoiphisms," described in U.S. Patent No. 5,272,057). 

/f'^ennore,tiie sequences ofthe invention can be used to provide an alternative 
technique tiiat determines the actiial base-by-base DNA sequence of selected portions of an 
individual's genome. Thus, the.NOVX sequences described herein can be used to prepare two 
PGR primers firom the 5'- and 3'-termini of the sequences. These primers can then be used to 

amplify an individual's DNA and subsequently sequence it 

Panels of corresponding DNA sequences fi-om individuals, prepared in this manner, 
can provide unique individual identifications, as each individual will have a unique set of such 
DNA sequences due to aUehc differences. The sequences of the invention can be used to 
obtain such identification sequences from mdividuals and from tissue. The NOVX sequences 
of the invention uniquely represent portions of the human genome. Allelic variation occurs to 
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some degree in the coding regions of these sequences, arid to a greater degree in the noncoding 
regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include restriction fragment length polymorphisms 
5 (RFLPs). 

Each of the sequences described herein can; to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. Because 
greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are 
necessary to differentiate individuals. The noncoding sequences can comfortably provide 
10 positive individual identification with a panel of perhaps 10 to 1,000 primers that each yield a 
noncoding amplified sequence of 1 00 bases. If predicted coding sequences, such as those in 
SEQ ID NOS: 1, 3, 5. 7. 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and 33 are used, a more 
appropriate number of primers for positive individual identification would be 500-2,000. 
Preidictive Medicine 

15 The invention also pertains to the field of predictive medicine in which diagnostic 

assays, prognostic assays, phaimacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to ther6by treat an individual prophylactically. Accordingly, 
one aspect of the invention relates to diagnostic assays for detennining NOVX protein and/or 
nucleic acid expression as weU as NOVX activity, in the context of a biological sanq)le (e.g., 

20 blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a 

disease or disorder, or is at risk of developing a disorder, associated with aberrant NOVX 
expression or activity. The disorders include metaboUc disorders, diabetes, obesity, infectious 
disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, 
Aldieimef's Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, 
25 and the various dysUpidemias, metabolic disturbances associated with obesity, the metaboHc 
syndrome X and wasting disorders associated with chronic diseases and various cancers: The 
invention also provides for prognostic (or predictive) assays for detennining whether an 

individual is at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. For example, mutations in an NOVX gene can be assayed in a . 
30 biological sample. Such assays can be used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset of a disorder characterized by or 
associated with NOVX protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
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prophylactic agents-for that individual (referred to herein as "phaimacogenomics"). " 
Phannacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or ' 
prophylactic treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual examined to determine the abihty of the individual to respond to a 
particular agent.) 

. . Yet another aspect of the invention pertainstomonitoringtheinfluenceof agents (e.g-., 
drags, compounds) on the expression or activity of NOVX in cUnical trials. 

These and other agents are described in iurther detail in the following sections. 
Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a biological 
sample involves obteining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting NOVX protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes NOVX protein such that the presence of NOVX is 
detected in the biological sample. An agent for detecting NOVX mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to NOVX mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length NOVX nucleic acid, such as the nucleic 
acid of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13. 15. 17. 19. 21. 23, 25. 27, 29, 31 and 33'. or aportion 
thereof, such as an ohgonucleotide of at least 15, 30. 50. 100. 250 or 500 nucleotides in length 
and sufficient to specifically hybridize under stringent conditions to NOVX mRNA or ■ 
genomic DNA Other suitable probes for use in the diagnostic assays of the invention are 
described herein. 

^^ge^t for detectmg NOVX protein is an antibody capable of binding to NO^ 
protem. preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 

preferably, monoclonal. An intact antibody, or a fragment thereof (e.^.. Fab or F(ab')2) can be • 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass 
direct labehng of the probe or antibody by coupling (i.e., physically Unking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody by 
reactivity with another reagent that is directly labeled. Examples of indirect labeling mclude 

detection of a primary antibody using a fluorescently-labeled secondary antibody and 
end-labeling of a DNA probe with biotin such that it can be detected with fluorescently- 
labeled stxeptavi^. The term "biological- sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as weU as tissues, cells and fluids present within a 
subject. That is, the detection method of the inventioh can be used to detect N0\^ mRNA, 
protein, or genomic DNA in a biological sample in vitro as well as in Uvo. For example, in 
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vitro techniques for detection of NOVX mRNA include Northern hybiidizations and in situ 
hybridizations. In vitro techniques for detection of NOVX protein include enzyme linked 
immunosorbent assays (ELIS As), Westem blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of NOVX genomic DNA include 
Southern hybridizations.. Furthemiore, in vivo techniques for detection of NOVX protein 
include introducing into a subject a labeled anti-NOVX antibody. For example, the antibody 
can be labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test . 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A prefeired biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject 

In another embodiment, the methods further involve obtaining a control biological 
sample firom a control subject, contacting flie control sample with a conq)ound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and comparing 
the presence of NOVX protein, mRNA or genomic DNA in the control sanq)le with the 
presence of NOVX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of NOVX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting NOVX protein or mRNA in a biological sample; means for determining the amount 
of NOVX in the sample; and means for comparing the amount of NOVX in the sample with a 
standard. The compound or agent can be packaged in a suitable container. The kit can finther 
comprise instnictions for using the Mt to detect NOVX protein or nucleic acid. " 
Prognostic Assays 

The diagnostic methods described herein can fiirthemiore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated With aberrant NOVX 
expression or activity. For example, the assays described herein, such as the precedmg 
diagnostic assays or the following assays, can be utilized to identify a subject having or at risk 
of developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for identifying a ' 
disease or disorder associated with aberrant NOVX expression or activity in which a test 
sample is obtained from a subject and NOVX protein or nucleic acid {e.g., mRNA, genomic 
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DNA) is detected, wherein the presence of NOVX protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or disorder associated with abeirant NOVX 
expression or acti^dty. As used herein, a "test sample" refers to a biological sample obtained 
from a subject of interest. For example, a test sample can be a biological fluid {e.g., serum), 
cell sample, or tissue. 

, ^^o'^'tt^e prognostic assays described herem can be used to detemiinewhea^^ 
a subject can be administered an agent {e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to t^eat a disease or disorder 
associated with aberrant NOVX expression or activity. For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a disorder. 
Thus, the invention provides methods for determining whether a subject can be effectively 
treated with an agent for a disorder associated with aberrant NOVX expression or activity in 
which a test sample is obtained and NOVX protein or nucleic acid is detected (e!^.. wherein 
the presence ofNOVX protein or nucleic acid is diagnostic for a subject that can be ,' 

administered the agent to treat a disorder associated with abetrantNOVX expression or 
activity). 

The methods of the mvention can also be used to detect genetic lesions in an NOVX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proUferation and/or differentiation. In various embodhnents the 
methods include detecting, in a sample of ceUs fiom the subject, the presence or absence of a 
genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an NOVX-pxotem. or the misexpression of the NOVX gene. For example, such 
genetic lesions can be detected by ascertaining the existence of at least one of: (0 a deletion of 
one or more nucleotides fiom an NOVX gene; (if) an addition of one or more nucleotides to an 
NOVX gene; (m-) a substitution of one or more nucleotides of an NOVX gene (iv) a 
chromosomal rearrangement of an NOVX gene; (v) an alteration in the level of a messenger 
RNA transcript of an NOVX gene, (v.^ aberrant modification of an NOVX gaie. such as of the 
methylation pattern of the genomic DNA, (vzO the presence of a non-wild-type spHcing pattern 
of a messenger RNA transcript of an NOVX gene, (yni) a non-wild-type level of an NOVX 
protem, (^) allehc loss of anNOVX gene, and (.) inappropriatepost-translational 
modification of an NOVXprotein. As described herein, there are a large nmnber of assay 

techmques known in the art which can be used for detectmg lesions in an NOVX gene A 
preferred biological sample is a peripheral blood.leukocyte sample isolated by conventional 
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means from a subject. However, any biological sample containing nucleated cells maybe 
used, including, for example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a piobe/primer in a 
polymerase chain reaction (PGR) {see. e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such ' 
as anchor PGR or RACE PGR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et al, 1988. Science 241: 1077-1080; and Nakazawa, et al, 1994. Proc. Natl. 
Acad. Sci. USA 91: 360-364), the latter of which can be particularly useful for detecting point 
mutations in the NOVX-gene {see, Abravaya, et al, 1995. Nucl. Acids Res. 23: 675-682). 
This method can include the steps of collecting a sample of cells from a patient, isolating 
nucleic acid {e.g. , genomic, mRNA or both) from the cells of the sample, contacting the 
hiicleic acid sample with one or more primers that specifically hybridize to an NOVX gene 
under conditions such, that hybridization and amplification of the NOVX gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the size 
of ttie amplification product and comparing the length to a control sanq>le. It is anticipated 
that PGR and/or LGR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication {see. 
GuatelU, a/., 1990. iVoc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional amplification 
system {see. Kwoh, et al., 1989. Proc. Natl. Acad. Sci. USA 86: 1173-1 177); QP RepUcase 
{see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to 
those of skill in the art. These detection schemes are especially usefiil for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an NOVX gene from a sample cell can be 
identified by altaations in restriction enzyme cleavage patterns. For esxaxaple, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and firagment length sizes are determined, by gel electrophoresis and compared. 
Differences in fiiagment length sizes between sample and control DNA indicates mutations in 
the sarbple DNA. Moreover, the use of sequence specific ribozymes (see. e.g., U.S. Patrait 
No. 5,493,531) can be used to score for the presence of specific mutations by development or 
loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by hybridizmg a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing 
hundreds or thousands of oligonucleotides probes. See. e.g.. Gronin, et al, 1996. Human 
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"" Mutation 7: 244-255; Kozal, etal., 1996.A^ar. Afe</. 2: 753-759. For example, genetic 
mutations inNOVX can be identified in two dimensional arrays containing light-generated 
DNA probes as described in Cronin, et al, supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a sample and control to identify base 
■ changes between the sequences by making linear arrays of sequential overlapping probes. 
This stq) allows the identification of point mutations. This is followed by a second - - 
hybridization array that allows the characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or mutations detected. Each mutation 
array is composed of parallel probe sets, one complementary to the wild-type gene and the 
other complementary to the mutant gene. 

In yet aiiother embodiment,' any of a variety of sequencing reactions known in the art 
can be used to directiy sequence the NOVX gene and detect mutations by comparing the 
sequence of the sanq)le NOVX witii tiie corresponding wild-type (control) sequence. 
Exanq)les of sequencing reactions include those based on techniques developed by Maxim and 
Gilbert, 1977. Proc. Natl. Acad. ScL USA 74: 560 or Sanger, 1977. Proc. Natl. Acad. Sci. USA 
74: 5463. It is also contemplated that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays {see. e.g., Naeve, et al, 1995. 
Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International PubKcation No. WO 94/16101; Cohen, et al, 1996. Adv. Chromatography 36: 
l27A62;axidQnfB3i,etal., 1993. Appl.Biochem.Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the NOVX gene include methods in- which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or : 
RNA/DNA heteroduplexes. See, e.g.. Myers, et al., 1985. Science 230: 1242. In generali'the 
art technique of "mismatch cleavage" starts by providing heteroduplex^ of fonned by • 
hybridizing aabeled) RNA or DNA containing the wild-type NOVX sequence with potentially 
mutant RNA or DNA obtained from a tissue sample. The double-sti^ded duplexes are 
treated with an agent that cleaves single-stranded regions of the duplex siich as whiph will 
. exist due to basepair mismatches between the control and sample strands. For instance, 
RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated with Si 
nuclease to enzymatically digesting the mismatched regions. In otiier embodurients, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide 
and with piperidine iii order to digest mismatched regions. After digestion of the mismatched 
regions, the resulting material is tiien separated by size on denaturing polyacrylamide gels to 
determine the site of mutation. See, e.g.. Cotton, et al., 1988. Proc. Natl. Acad. ScL USA 85: 
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4397; Saleeba, et ah, 1992. Methods Eiizymoh 217: 286-295. In an embodiment, the control 
DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
5 mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
NOVX cDNAs obtained from samples of cells. For example, &e mutY enzyme of ^. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase Jfrom HeLa cells cleaves T 
at G/T mismatches. See, e.g„ Hsu, etaL, 1994. Carcinogenesis 15: 1657-1662. According to 
an exemplary embodiment, a probe based on an NOVX sequence, e.g., a wild-type NOVX 
10 sequence, is hybridized to a cDNA or other DNA product firom a test cell(s). The duplex is 
treated with a DNA niismatch repair enzyme, and the cleavage products, if any, can be 
detected fi-om electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

. In other embodiments, alterations in electrophoretic mobility will be used . to identify 
mutations in NOVX genes. For example, single strand conformation polymorphism (SSCP) 
15 may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g, Qrita, et al, 1989. Proc Natl Acad. Set USA: 86: 2766; Cotton, 
1993. Mutat Res. 285: 125-144; Hayashi, 1992. Genet. Anal Tech. Appl. 9: 73-79. 
Single-stranded DNA firagments of sample and control NOVX nucleic acids will be denatured 
and allowed to renature. The secondary stmcture of single-stranded nucleic acids varies 
20 according to sequence, the resulting alteration in electrophoretic mobility enables the detection 
of even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in 
which the secondary structure is more sensitive to a change in sequence. In one embodiment, 
the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
25 molecules on the basis of changes in electrophoretic mobility. See, e.g.. Keen, et al., 1991. 
Trends Genet. 7: 5, . , . , 

In yet another embodiment, the movement of mutant or wild-type fi-agments in 
polyacrylamide gels contairdng a gradient of denaturant is assayed using dCTaturing gradient 
gel electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. When DGGE is 
30 used as the method of analysis, DNA will be modified to insure that it does not completely 

denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich 
DNA by PGR, In a further embodiment, a temperature gradient is used in place of a 
denaturing gradient to identify differences in the mobility of control and sample DNA, See, 
e.g, Rosenbaum and Reissner, 1987. Biophys. Chem, 265: 12753. 
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Examples Of other.techniquesfor-detectmgpomtmutatio^^^^^^ 
to, selective oKgonucleotide hybridization, selective amplification, or selective primer, 
extension. For example. oKgonucleotide primers may be prepared in which the known 
mutation is placed centrally and Ihen hybridized to target DNA under conditions that permit 
hybridization onlyifaperfectmatchis found. .See, e.g., SaM, et al., 1986. Nature 324- 163- 
SaiM, et al, .1989. Proc Natl Acad. Sci. USA 86: 6230. Such allele specific oKgonucleotide's 
are hybridized to PGR amplified target DNA or a number of different mutations when the 
' ohgonucleotides ate attached to the hybridizing membrane and hybridized with labeled target 
DNA. . . 

Alternatively, allele specific amplification technology that depends on selective PGR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
pnmers for specific amplification may carry the mutation of interest in the center of the 
molecule (so that amplification depends on differential hybridization;- see. e.^., tSibbs et al 
1989.i^«c/.^«&i2e,.17:2437-2448)orattheextreme3vtemunusofoneprim^ ' 
under appropriate conditions, mismateh canprevent, or reduce polymerase extension (see eg. 
Prossner, 1993. TWtech, 11: 238). hi addition it may be desirable to introduce a-novel ' 
restiiction site in the region of the mutation to create cleavage-based detection See e g. 
Gasparini,e/a/., 1992.M,/. CeUProbes6: 1. It is anticipated that in certain embodiments 
amphfication may also be performed using Tag Hgase for ampUfication. See. e.g., Barany 
im. ProcNatl Acad. Sci. USASS: 189. M such cases. Hgation will occur only if there i's a 
perfect match at the3'-terminusofthe 5- sequence, maldng it possible to detect the pre^^^^ 
a.kpQwnmutation at a specific site by lookmg for the presence or absence of amplification. 
. .i . .. .The^m^^^ 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose - 
patients exhibiting symptoms or family history of a disease or iUness involving an NOVX 
gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
NOyx rs expressed may be utiUzed in tiie prognostic assays described herem. However any 
bxologrcal sample containing nucleated ceUs may be used, including, for exan^le, buccd 
mucosal cells. 

Pharmacogenomics 

Agents, or modulators that have a stimulatory or inWbitory effect on NOVX acti^dty 
(e.g., NOVX gene expression), as identified by a screening assay described herein can be 
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administered to individuals to treat (prophylactically or therapeutically) disorders (The 
disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
disorders associated with chronic diseases and various cancers.) In conjimction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's 
genotype and that individual's response to a foreign compound or drug) of the individual may 
be considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the 
selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such pharmacogenomics can further be used to 
determine appropriate dosages and therapeutic regimens. Accordingly, the activity of NOVX 
protein, expression of NOVX nucleic acid, or mutation content of NOVX genes in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to dmgs due to altered drug disposition and abnormal action in affected persons. See 
e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol Physiol, 23: 983-985; Linder, 1997. Clin. 
Chem.^ 43: 254-266. In gdieral, two types of phaimacogenetic conditions can be 
differentiated.. Genetic conditions transmitted as'a single factor altering the way drugs act on 
the body (altered drug action) or genetic conditions transmitted as siagle factors altering the 
way the body acts on drugs (altered drug metabolism). These phannacogenetic conditions can 
occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficimcy is a common inherited enzymopathy in which the maiTi 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of dmg metabolizing enzymes is a major 
determinant of both the intensity and duration of dmg action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g,, N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated dmg response and 
serious toxicity after taking the standard and safe dose of a dmg. These polymorphisms are 
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expressed in two phenotypes in the populaticmi. the extensive . 
metabolizer (PM). The prevalence, of PM is different among differentpopulations. For 
example, the gene coding for CYP2D6 is highlypolymoiphic and several mutations have been 
identified in PM. which all lead to the absence of functional CYP2D6. Poor metabolizers of 
CYP2D6 and CYP2C19 quite fi-equently experience exaggerated drug response and side 
effects when they receive standard doses. If a metabolite is the active therapeutic moiety, PM. 
.show no liierapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its GYP2D6-fonned metaboUte morphine. At the other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of : 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. . . . 

. Thus, the activity ofNOyx protein, expression ofNOVX nucleic acid, or mutation 
content ofNOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition,, 
phamiacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 
drug-metabolizing enzymes to the identification of an individual's drug responsiveness, 
phenotype. Tliis knowledge, when appUed to dosing or drug selection, can avoid adverse 
reactions or therapeutic faUm-e and thus enhance ther^eutic or prophylactic efficiency when 

treating a subject with an NbVX modulator, such as a modulator iden^^ 
exemplary screening assays described herein 

20 Monitoring ofEffects During Clinical Trials 

Monitoring the influence of agents (e^., drugs, compomids) on the expression or 
a^^^vity Of NOVX (e^., the abihty to modulate aberrant cell piolif^^^^ 

dififerentiation)caiibe^pfiednotonlyinbasicdrugscreening.butalsoinclm^^^ For 
exanq,le. the effectiveness of an agent determined by a screening assay as described h^ein to 

mcrease NOVX gene expression, protein levels, or upregulate NOVX activity, can be 
monitored in clinical trails of subjects exhibiting decreased NOVX gene expression, protein 
levels, or dowm-egulated NOVX activity. Alternatively, the effectiveness of an agent 
determined by a screening assay to decrease NOVX gene expression, protein levels or 
dowm-egulate NOVX activity, can be monitored in clinical trails of subjects exhibiting 
increased NOVX gene expression, protein levels, or upregulated NOVX activity. In such 
clinical trials, the expression or activity ofNOVX and, preferably, other genes that have been 
maphcated in. for example, a cellular proliferation or immune disorder can be used as a "read 
out" or markers of the immune responsiveness of a particular cell. . . 
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By way of example • and not of limitation, genes, including NO VX, that are modulated 
in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates 
NOVX activity (e.g., identiJBed in a screening assay as described herein) can be identified. 
Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 
5 trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

NOVX and other genes implicated in flie disordOT. Hie levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Norfhem blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the methods 
as described herein, or by measuring the levels of activity of NOVX or other genes. In this 

1 0 manner, the gene expression pattern can serve as a marker, indicative of the physiological. 

response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during, treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness 
of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 

1 5 pq>tidoinimetic, nucleic acid, small molecule, or other drug candidate identified by the 

screening assays described herein) comprising the steps of (i) obtaining a pre-administration 
sample firom a subject prior to administration of the agent; (ii) detecting the level of ejjpression 
of an NOVX protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining 
one or more post-adnainistratipn samples firom the subject; (iv) detecting the level of 

20 expression or activity ofthe NOVX protein, mKNTA, or genomic DNA in the 

post-administration samples; (v) comparing the level of expression or activity of the NOVX 
protein, mRNA, or genomic DNA in the pre-administration sample with the NOVX protein, 
mRNA, or genomic DNA in the post administration sample or samples; and (vi) altering the 
administration ofthe agent to the subject accordingly. For example, increased administration 

25 of the agent may be desirable to increase the expression or activity of NOVX to higher levels 
than detected, i.e., to increase the effectiveness of the agent Alternatively, decreased 
administration of the agent may be desirable to decrease expression or activity of NOVX to 
lowCT levels than detected, to decrease the effectiveness of the agent. 
Methods of Treatment 

30 The invention provides for both prophylactic and therapeutic methods of treating a 

subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
NOVX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
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ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility. hemophiUa. hypercoagulation, 
idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Ostoeodystrophy, and oth^ diseases, disoiders and conditions of the like. 
These methods of treatment will be discussed more fully, below. 

Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (z.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may 
be utilized include, but are not limited to: (i) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; («) antibodies to an aforementioned peptide; (Hi) 
nucleic acids encoding an aforementioned peptide; (iv) administration of antisense nucleic acid 
and nucleic acids that are "dysfunctional" (i.e.. due to a heterologous insertion within the 
coding sequences of coding sequences to an aforementioned peptide) that are utilized to 
"knockout" endogenous function of an aforementioned peptide by homologous recombination 
(see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( Ue., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or antibodies 
specific to a peptide of the invention) that alter the interaction between an aforementioned 
peptide arid its binding paataet. . . 

Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity n^y be treated with" 
Therapeutics that increase (Le., are agonists to) activity. Therapeutics that upregulate activity 
maybe administered in a tiierapeutic or prophylactic manner. Therapeutics that may be 
utilize! include, but are not limited to, an aforementioned peptide, or analogs, derivatives, 
fragments or homoiois thereof; or an agonist that increases bioavailabiUty. " 

' . or decreased Wels can be readily detected by qu^^^^ 

RNA. by obtaining a patient tissue sample {e.g.. from biopsy tissue) and assajdng it in vitro for 
RNA or peptide levels, stinicture and/or activity of the expressed peptides (or mRNAs of an ' 
aforementioned peptide). Methods that are well-known within the art include, but are not 
limited to, immmioassays i_e.g., by Western blot analysis, immunoprecipitation followed by 
sodimn dodecyl sulfate (SDS) polyaciylamide gel electn,phoresis. immunocy^ochemistiy, etc.) 
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and/or hybridization assays to detect expression ofmRNAs (e.g.; Northern assays, dot blots, in 
hybridization, and the like). 

Prophylactic Methods 

hi one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant NOVX expression or activity, by administering to the 
subject an agent that modulates NOVX expression or at least one NOVX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant NOVX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the NOVX aberrancy, such that a disease or disorder is 
prevented or, alternatively, delayed in its progression. Depending iqjon the type of NOVX 
aberrancy, for example, an NOVX agonist or NOVX antagonist agait can be used for treating 
the subject. The appropriate agent can be determined based on screening assays described 
herein. The prophylactic methods of the invention are fiarther discussed in the following 
subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX expression 
or activity for ther^eutic purposes. The modulatory method of fhe invention involves 
contacting a cell with an agent that modulates one or more of the activities of NOVX protein 
activity associated with the cell. An agent that modulates NOVX protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand.of an NOVX protein, a peptide, an NOVX peptidomimetic, or other small molecule! hi 
one embodiment, the agent stimulates one or more NOVX protem activity. Examples of such 
stimulatory agents include active NOVX protein and a nucleic acid molecule encoding NOVX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
NOVX protein activity. Exanqjles of such inhibitory agents include antisense NOVX nucleic 
acid molecules and anti-NOVX antibodies. These modulatory methods can be performed in 
vitro (eg., by culturing the cell with the agent) or, alternatively, in vivo ie.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an 
NOVX protein or nucleic acid molecule, hi one embodiment, the method involves 
administering an agent (e.g^., an agent identified by a screening assay described herein), or 
combination of agents that modulates (e.^^., up-regulates or down-regulates) NOVX expression 
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or activity. In another embodimenV-the method involves administeiingian NOVX. protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression or 
activity. . ^ . . . . 

... StimulatianofNOVXactivityisdesirable.i«^zft/a±ionsinwM 
downregulated and/or in which increased NOVX activity is likely to have a beneficial, effect. . 
One example of such a situation is where a subject has a disorder chai^terized by aberrant 
cell preUferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
.example of such a situation is where the subject has a gestational disease ie.g., preclampsia). 
Determination of the Biological Effect of the Therapeutic 

In various embodimaits of the invention, suitable in vitro or in vivo assays are 
performed to detennine the effect of a specific TTierapeutic and whether its administration is 
indicated for treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the 1ype(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
the desired effect iqjonlhe cell type(s). Compomids for use in therapy may be tested in 
suitable animal model systems including, but not limited to rats, mice, chicken, cows 
monkeys, rabbits, and the like, prior to testing in hmnan subjects. Similarly, for zn v/v« 
testing, any of the animal model system known in the art maybe usedprior to administration 
to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the^^^^^^ 

: • ••- The JJOVX nucleic acids and proteins of the invention are useful in potential 
prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
hmrted to: metaboHc disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
..associated cancer, neurodegenerative disorder., Alzheimer's Disease, Parkinson's Disorder 
nrnnune disorders, hematopoietic disorders, and the various dyslipidemias, metabolic < ' 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 
associated with chronic diseases and various cancers. , . . 

... ..: A, an example, a cDNA encoding the NOVX protein of the invention may be useful in 
genetherapy,and the protein may be useful when administered toasubject indeed thereof 
By way .ofnon-hmiting example, the compositions of the invention will have efficacy for 
treatment of patients suffering fiom: metaboUc disorders, diabetes, obesity, infectious disease 
-anorexia, cancer-associated.cachexia, cancer, neurodegenerative disorders. Alzheimer's 
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Disease, Parkinson's Disorder, inunune disorders, hematojpoietic disorders, and the various 
dyslipidenoiias. . 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could 
be as an anti-bacterial molecule (z.e., some peptides have been found to possess anti-bacterial 
properties). These materials are further usefid in the generation of antibodies, which, 
immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

The invention will be further described in the following examples, which do not linait 
the scope of the invention described in the claims. 

Examples 

Example 1 : Identification of NOVX Nucleic Acids 

TblastN using CuraGen Corporation's sequence file for polypeptides or homologs was 
run against the Genomic Daily Files made available by GenBank or from files downloaded 
from the individual sequencing centers. Exons were predicted by homology and the 
intron/exon boundaries were determined using standard genetic rules. Exons were further 
selected and refined by means of similarity determination using multiple BLAST (for 
example, tBlastN, BlastX, and BlastN) searches, and, in some instances, GeneScan and Grail. 
Expressed sequences from both public and proprietary databases were also added when 
available to further define and complete the gene sequence. The DNA sequence was then 
manually corrected for apparent inconsistencies thereby obtaining the sequences encoding the 
ftdHength protein. 

' - The novel NOVX target sequences identified in the present invention were subjected to 
the exon linking process to confirm the sequence. PGR primers were designed by starting at 
the most upstream sequence available, for the forward primer, and at the most downstream 
sequence available for the reverse primer. PGR primer sequences were used for obtaining 
different clones. In each case, the sequence was examined, walking inward from the 
respective termini toward the coding sequence, until a suitable sequence that is either unique 
or highly selective was encountered, or, in the case of the reverse primer, until the stop codon 
was reached. Such primers were designed based on in sihco predictions for the full length 
cDNA, part (one or more exons) of the DNA or protein sequence of the target sequence, or by 
translated homology of the predicted exons to closely related human sequences from other 
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species. These primers were then employed ii PGR amplificatioii based on the following pool 
of hmnan cDNAs: adrenal gland, boae mairdw, brain - amygdala, brain - cerebelliin, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal Uver, fetal lung; heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary 
5 gland, placenta, prostate, salivary gland, skeletal muscle, smaU intestine, spinal cord, spleen, 
stomach, testis, thyroid, trachea, uterus. Usually the resulting ampHcons were gel purified, 
cloned and sequenced to high redundancy. The PGR product derived from exon linking was 
cloned into the PGR2.1 vector from Invitrogen. The resulting bacterial clone has an insert 
covering the entire open reading frame cloned into the pGR2.1 vector. The resulting sequences 
10 from all clones were assembled with themselves, with other fragments in CuiaGen 

Corporation's database and with pubhc ESTs. Fragments and ESTs were included as • 
components for an assembly when the extent of their identity with another component of the 
assembly was at least 95% over 50 bp.. In addition, sequence traces were evaluated manuaUy 
and edited for corrections if appropriate. These procedures provide the sequence reported 
15 herein. : : 

Physical clone: Exons were predicted by homology and the intron/exon boundaries 
were determined using standard genetic rules. Exons were further selected and refined by 
means of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from both 
20 public and proprietary databases were also added when available to fruther define and ' 

ccJinplete the gene sequence. The DNA sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the fiill-length protein. . r 

Example 2: identification of Single Nucleotide Polymorphisms in NOVX nucleic acid 
sequences 

25 Variant sequences are also included in this application. A variant sequence can include 

a single,nucleotidepol3anorphism(SNP). A SNP can, in some instances, be referred to as a 
. "cSNp- tp denote tjiat tiie nucleotide sequence containing the SNP originates as a cDNA. A 
SNP can arise in several ways. For example, a SNP may be due to a substitution of one 
nucleotide for another at tlie polymorphic site. Such a substitution can be either a transition or 

30 a transversion. A SNP can also arise from a deletion of a nucleotide or an insertion of a 

nucleotide, relative to a reference allele. In this case, the polymorphic site is a site at which 
one allele bears a gap with respect to a particular nucleotide in another allele. SNPs occurring 
within genes may result in an alteration of the amino acid encoded by the gene at the position 
of the SNP. Intragenic SNPs may also be silent, when a codon including a SNP encodes the 
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same amino acid as a result of the redxmdaricy of the genetic code. SNPs occurring outside the 
region of a gene, or in an intron within a gene, do not resiilt in changes in any amino acid 
sequence of a protein but may result in altered regulation of the expression pattem. Examples 
include alteration in temporal expression, physiological response regulation, cell type . 
5 expression regulation, intensity of expression, and stability of transcribed message. 

SeqCalling assemblies produced by the exon linking process were selected and 
extended using the following criteria. Genomic clones having regions with 98% identity to all 
or part of the initial or extended sequence were identified by BLASTN searches using the 
relevant sequence to query human genomic databases. The genomic clones that resulted were 
10 selected for further analysis because this identity indicates that these clones contain the 

genomic locus for these SeqCaliing assemblies. These sequences were analyzed for putative 
coding regions as well as for similarity to the known DNA and protein sequences. Programs 
used for these analyses include Grail, Genscan, BLAST, HMMER, FASTA, Hybrid and other 
relevant programs. 

15 Some additional genomic regions may have also been identified because selected 

SeqCalling assemblies map to those regions. Such SeqCalling sequences may have 
overlapped with regions defined by homology or exon prediction. They may also be included 
because the location of the fragment was in the vicinity of genomic regions identified by 
similarity or exon prediction that had been included in the original predicted sequence. The 

20 sequence so identified was manually assembled and then may have been extended using one 
or more additional sequences taken fi-om CuraGen Corporation's human SeqCalling database, 
SeqCalling fragments suitable for inclusion were identified by the CuraTopls™ program 
SeqExtend or by identifying SeqCalling fragments mapping to the appropriate regions of the 
genomic clones analyzed. 

25 The regions defined by the procedures described above were then manually integrated 

and corrected for apparent inconsistencies that may have arisen, for example, from miscalled 
bases in the original fragments or from discrepancies between predicted exon junctions, EST 
locations and regions of sequence similarity, to derive the final sequCTice disclosed herein. 
When necessary, the process to identify and analyze SeqCalling assemblies and genomic 

30 clones was reiterated to derive the fiiU length sequence. 

Example 3. Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR(RTQPCR). RTQ PCR was performed on a Perkin- 
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Elmer Biosystems.ABI.;PRISM®7700SequenceDetecti^^^ . 
samples are assembled ob the plates, and referred to as Panel 1 (containing nonnal'tissues and 

cancer celUines), Panel 2 (containing samples derived tissues to nomid and cancCT 
sources). Panel 3 (containing cancer ceU lines), Panel 4 (containing cells and cell lines from 

5 nonnal tissues and celk related to inflanmiatory conditions). Panel 5D/5I (cental 

tissues and cell lines with an emphasis on metaboKc diseases), AI_comprehensive_panel 
(containing normal tissue and samples from autoinflanunatory diseases). Panel CNSD.Ol 

(w^ntaining samples from normal and diseased brains) and C^^S.neurodegenerationjjan^^ 
(containing samples from normal and Alzheimer's diseased brains). 

10 RNA integrity from all samples is controlled for quaUty by visual assessment of 

agarose gBl electropherograms using 28S and i8S ribosomal RNA staining intensity-^tio as a 
guide (2: 1 to 2.5: 1 28s: 1 8s) and the absence of low molecular weight RNAs that would be 
indicative of degradation products. Samples are controlled against genomic DNA ■ 
contamination by RTQ PGR reactions run in the absence of reverse transcriptase using probe 

15 , and primer sets designed to amplify across the span of a single exon. . : 

- First, the RNA samples were nohndized to reference nucleic acids such as : 
constitutively expressed genes (for example. P-actin and GAPDH). Normalized RNA (5 ul) 

was converted to cDNA and analyzed by RTQ-PCR using Ctae Step RT-PCR Master Mix 
Reagents (PE Biosystems; Catalog No. 4309169) and gene-specific primers according to the 
manufacturer's instructions. Probes and primers were designed for each assay according to 
Perkm Ehner Biosystem's Primer Express Software package (version I for Apple Computer's 

MacintoshPower PC) or a similar algorithm using the target sequence as input. Default 
settings were used for reaction conditions and the following parameters were set before ■ 

selecting primers: primer concentration = 250 nM, primer melting temperature (T J range = 

, 58--60' C, primer optimal Tm = 59° C. maximum primer difference = 2^C, probe does n^^ 
have 5' G, probe must be 1 0« C greater than primer amplicon size 75 bp to 100 bp. 
The probes and primers selected (see below) were synthesized by Synthegen (Houston. TX, 

USA). Probes were double purified by HPLC to remove uncoupled dye and evaluated by ' 
mass spectroscopy to verify.coupling of reporter and quencher dyes to the 5' and 3' ends of 
the probe, respectively. Their final concentrations were: forward and reverse primers. 900 nM 
each, and probe, 200nM. 

PGR conditions: Normalized RNA from each tissue and.each cell line was spotted in 
each well of a 96 well PGR plate (Perkin Ehner Biosystems). . .PGR cocktails including two 
probes (a probe specific for the target.clone and anollier gene-specific probe multiplexed with 
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the target probe) were set up using IX TaqMan™ PGR Master Mix for the PE Biosystems 
7700, with 5 mM MgC12, dNTPs (dA, G, C, U at 1:1:1:2 ratios), 0.25 U/ml AmpUTaq Gold™ 
(PE Biosystems), and 0.4 W\xl RNase inhibitor, and 0.25 U/^il reverse transcriptase. Reverse 
transcription was performed at 48" C for 30 minutes followed by amplification/PCR cycles as 
follows: 95° C 10 min, then 40 cycles of 95" C for 15 seconds, 60" C for 1 minute. Results 
were recorded as CT values (cycle at which a given sample crosses a threshold level of 
fluofesc^ce) using a log scale, with the difference in RNA concentration between a given 
sample and the sample with the lowest CT value being represented as 2 to the power of delta 
CT. The percent relative expression is then obtained by taking the reciprocal of this RNA 
difference and multiplying by 100. 
Panels 1, 1.1, 1.2, and 1.3D 

The plates for Panels 1, 1.1, 1.2 and 1.3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA fiom various samples. The 
samples in these panels are broken into 2 classes: samples derived fiom cultured cell lines and 
samples derived fiom primary normal tissues. The cell lines are derived from cancers of the 
following types: lurig cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS 
cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and 
pancreatic cancer. Cell lines used in these panels are widely available through the American 
Type Culture Collection (ATCC), a repository for cultured cell lines, and were cultured using 
the conditions recommraided by the ATCC. The normal tissues found on these panels are 
comprised of samples derived from all major organ systems from single adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver; fetal- Hver, adult 
lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancareas, 
saHvary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, smaU intestine, 
colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. - 
In the results for Panels 1 , 1 . 1, 1 .2 and 1 .3D, the following abbreviations are used: 
ca. = carcinoma, 
* = established from metastasis, 
met = metastasis, . . 

s cell var = small cell variant, 
non-s = non-sm = non-small, 
squam = squamous,. 
. pi. efp= pi effusion = pleural effusion,. . 
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glio = glioma, 

astro = astrocytoma, and , v ' 

neuro = neuroblastoma. 

Geiieral_screenmg;_panel_vl ,4 

The plates for Panel 1 .4 include 2 control wells (genomic DNA control and chemistry 
control) and 94 wells containing cDNA from various samples. The samples inPanel 1.4 are 
broken into 2 classes: samples derived &om cultured cell lines and samples derived from 
primary normal tissues. The cell lines are derived from cancers of the foUo wing types: lung 
cancer, breast cancer, melanoma, colon cancer, prostate cancer. CNS cancer, squamous cell 
carcinoma, ovarian cancer, Uver cancer, renal cancer, gastric cancer and pancreatic cancer. 
Cell lines used in Panel 1.4 are widely avaUable through the American Type Culture 
Collection (ATCC). a repository for cultured ceU lines, and were cultured using the conditions 
recommended by the ATCC. The normal tissues found on Panel 1 .4 are comprised of podls of 
sanqjles derived from all major organ systems from 2 to 5 different adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 
Imig, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
sahvary gland, pituitary gland, adrenal gland, spinal cord, Ihymus. stomach, small intestine, 
colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 
20 Panels 2D and 2.2 

V • - The plates for Panels 2D and 2.2 generally include 2 conlxol wells and 94 test samples 

- composed of RNA or cDNA isolated from human tissue procured by surgeons working in 
close cooperation with the National Cancer Institute's Cooperative Human-Tissue Network 
(CHTN) or the National Disease Research Initiative (NDRI). The tissues are derived from 
human malignancies and in cases where indicated many maHgnant tissues have "matched 
margins" obtained from noncancerous tissuejust adjacent to the tumor. These are teraied 
nonnal adjacent tissues and are denoted 'mr ' in the insults below. The tuni^jr tissue and the 
"matched margins" are evaluated by two indq,endent pathologists (the surgical pathologists 

and again by a pathologists at NDRI or CHTN). This analysis provides a gross - 
histopathological assessriient of tumor differentiation grade. Moreover, most samples include 
the original surgical pathology rq,ort that provides information regarding the clinical stage of 
the patient. These matched margins are taken from the tissue surrounding (i.e. immediately 
proximal) to the zone of sm-gery (designktdd 'mT". for normal adjacent tissue, in Table RR). 
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In addition, RNA and cDNA sanfiples were obtained from various human tissues derived from 
autopsies performed on elderly people or sudden death victims (accidents, etc.). These tissues 
were ascertained to be free of disease and were purchased from various commercial sources 
such as Clontech (Palo Alto, CA), Research Genetics, and Invitrogen. 

Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control samples. 
Specifically, 92 of these samples are derived from cultured human cancer cell lines, 2 samples 
of human primary cerebellar tissue and 2 controls. The human cell lines are generally 
obtained from ATCC (American Type Culture Collection), NCI or the German tumor cell 
bank and fall into the following tissue groups: Squamous cell carcinoma of the tongue, breast 
cancer, prostate cancer, melanoma, epider^noid carcinoma, sarcomas, bladder carcinomas, 
pancreatic cancers, kidney cancers, leukemias/lymphomas, ovarian/uterine/cervical, gastric, 
colon, lung and CNS cancer cell lines. In addition, there are two independent samples of 
cerebellum. These cells are all cultured under standard reconmiended conditions and RNA 
extracted using the standard procedures. The cell lines in panel 3D and 1.3D are of the most 
common cell lines used in the scientific literature. 

Panels 4D, 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human cell 
lines or tissues related to inflammatory conditions. Total RNA from control normal tissues 
such as colon and limg (Stratagene, La Jolla, CA) and thymus and kidney. (Clontech) were 
, employed. Total RNA from liver tissue from cirrhosis patients and kidney from lupus patients 
was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). Intestinal tissue for 
RNA preparation from patients diagnosed as having Crohn's disease and xilcerative colitis was 
obtained from the National Disease Research Interchange (NDRT) (Philadelphia, PA). 

. Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 
small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human ' 
umbilical vein endothehal cells wore all purchased from Clonetics (Walkersville, MD) and 
grown in the media supplied for these cell types by Clonetics. These primary cell types were 
activated with various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as 
indicated. The foUoAving cytokines were used; IL-l beta at approximately 1-5 ng/ml^ TNF 
alpha at approximately 5-10 ng/ml, IFN gamma at approximately 20-50 ng/ml, IL-4 at . ^ 
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approximately 5-10 ng/ml. IL-9 at approximately 5-10 ng/ml, IL-13 at approximately 5-10 
ng/ml. Endothelial cells were sometimes starved for various times by culture in the basal 
media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuiaGen Corporation, 
using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% FCS 
(Hyclone). 100 m non essential amino acids (CJibco/Life Technologies, Rockville, MD), 1 
mM sodimn.pyruvate (Gibco). mercaptoe&anol 5.5 x lO"^ M (Gibco), and 10 mM Hepes ' 
(CJibco) and Ihterleukin 2 for 4-6 days. CeUs were then eiliier activated with 1 0-20 ng/ml 
PMA and 1-2 ^xg/ml ionomycin, lL-12 at 5-10 ng/ml, IFN gamma at 20-50 ng/ml and IL-18 at 
5-10 ng/ml for 6 hours. In some cases, mononuclear cells were cultured for 4-5 days in 

DMEM 50/0 FCS (Hyclone), 100 MM non essential amino acids (Gibco), 1 mM sodium- 
pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (Gibco) with 
PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5 ^gAnl. Samples 
were taken at 24, 48 and 72 hours for RNApreparation. MLR (mixed lymphocyte reaction) 
1 5 samples were obtained by taking blood from two donors, isolating the mononuclear cells using 
Facoll and mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 
2x10^ cells/ml in DMEM 50/0 FCS (Hyclone). 100 mM non essential amino acids (Gibco) 1 
niM sodium pyruvate (Gftco). merc^toethanol (5.5 X 10-^ M) (Gibc^^^ 

(Gibco). The MI^ was cultured and samples taken at various time points i^ging from 1 ^ 
20 days for KNA preparation. 

Monocytes were isolated from mononuclear cells using CD14 Miltenyi Beads +ve VS 
selection eoluinns and a Vario Magnet according to the manufacturer's instructions ^ 

Monocytes were differentiated into dendritic cells by culture in DMEM 50/0 fetal calf serum 

(FCS) (Hyclone, Logan, UT), 100 pM non essential amino acids (Gibco). 1 mM sodium 

pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco). and 10 mM Hepes (Gibco) 50 ng/ml 

GMCSF and 5 ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes 

for 5-7 days in DMEM 5% FCS (Hyclone), 100 ^M non essential amino acids (Gibco) 1 mM 

sodium pyruvate (Gibco), mercaptoelhanol 5.5 x lO:^ M (Gibco). 10 mM Hepes (Gibco) and 

100/0 AB Human Serum orMCSF at approximately 50 ng/ml. Monocytes, macrophages and 

dendntic cells were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 100 

ng/ml. Dendritic cells were also stimulated with anti-CD40 monoclonal antibody 

(Phaimingen) at 10 ng/ml for 6 and 12-14 hours. 

CD4 lymphocytes. CDS lymphocytes and NK cells were also isolated from 
monomrclear cells using CD4. CD8 and CD56 Miltenyi beads, positive VS selection columns 
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and a Vario Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 
lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, CD14 and CD19 
cells using CDS, CD56, CD14 and CD19 Miltenyi beads and positive selection. Then 
CD45RO beads were used to isolate the CD45RO CD4 lymphocytes with the remaining ceUs 
5 being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO 0)4 and CDS lymphocytes 

were placed in DMEM 5% FCS (Hyclone). 100 non essential amino adds (Gibco), 1 mM 
sodium pyruvate (Gibco), mercaptoethmol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (Gibco) 
and plated at lO*' cells/ml onto Falcon 6 well tissue culture plates fliat had been coated 
overnight with 0.5 ugAnl anti-CD28 (Phanningen) and 3 ug/ml anti-CD3 (OKT3, ATCC) in 
10 PBS. After 6 and 24 hours, the cells were harvested for RNA preparation. To prepare 

chronically activated GD8 lymphocytes, we activate<i the isolated CDS lymphocytes for 4 days 
on anti-CD28 and anti-CD3 coated plates and then harvested the ceUs and expanded them in 
DMEM 5% FCS (Hyclone), 100 |jM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (Gfljco) and IL-2. 
1 5 The expanded CDS cells were then activated again with plate bound anti-CD3 and anti-CD28 
for 4 days and expanded as before. RNA was isolated 6 and 24 hours after the second 
activation and after 4 days of the second expansion culture. The isolated NK cells were 
cultured in DMEM 5% FCS (Hyclone). 100 pM non essential amino adds (Gibco), 1 mM 
sodium pyruvate (Gibco), merc^toethanol 5.5 x 10"^ M (Gibco), and 10 mM Hepes (CHbco) 
20 and IL-2 for 4-6 days before RNA was prepared. 

. To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with sterile 
dissectmg scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10^ cells/ml in DMEM 5% FCS (Hyclone), 100 mM non essential ammo acids 
(Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x lO'^ M (Gibco), and 10 mM 
Hepes (Gibco). To activate the cells, we used PWM at 5 ^ig/ml or anti-CD40 (Pharmingeo) at 
approximately 10 ug/ml and IL-4 at 5-10 ngtol. Cells were harvested for RNA prep^aration at 
24, 48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates 
were coated overnight with 10 ^g/ml anti-CD28 (Phanningen) and 2 ^g/ml OKT3 (ATCC), 
and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, 
German Town, MD) were cultured at 10^-10^ cells/ml in DMEM 5% FCS (Hyclone), 100 nM 
non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x lO' 
^M (Gibco), 10 mM Hepes (Gibco) and IL-2 (4 ng/ml). IL-12 (5 ng/mi) and anti-54 (1 
M-g/ml) were used to direct to Thl, whUe IL-4 (5 ng/ml) and anti-IEN gamma (1 ^g/ml) were 
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. used to direct to Th2: and IL-10 at 5 ng/ml was used to direct to Trl. After 4-5 days, the 
, activated Thl, Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 
days in DMEM 5% FCS (Hyclone); 100 ixM non essential amino acids (Gibco), 1 mM sodinm 
pyruvate (Gibco), mercaptoethanol 5.5 x 10"^ M (Gibco), 10 mM Hepes (Gibco) and IL-2 (1 
ng/ml). Following this, the activated Thl, Th2 and Trl lymphocytes were re-stimulated for 5 
days with anti-CD28/OKT3 and cytokines as described above, but with the addition of anti- 
CD95L (1 ng/ml) to prevent apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes 
were washed and then expanded again with IL-2 for 4-7 days. Activated Thl and Th2 
lymphocytes were maintained in this way for a maximum of three cycles. RNA was prepared 
from primary and secondary Thl, Th2 and Trl after 6 and 24 hours following the second and 
third activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and tiiird expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EQL-l, 
KU-812. EOL cells were fiirther differentiated by culture in 0. 1 mM dbcAMP at 5 xlO^ 
cells/ml for 8 days, changing the media every 3 days and adjustmg the cell concentration to 5 
xlO^ cells/ml. Forthecultureof these cells, we used DMEM or RPMI (as recommended by 
the ATCC), with the addition of 5% FCS (Hyclone), 100 pM non essential amino acids 
(Gibco), 1 mM sodium pyruvate (Gibco), merc^toethanol 5.5 x 10"^ M (Gibco), 10 mM 
Hepes (Gibco). RNA was either pr^ared from resting cells or cells activated with PMA at 10 
ng/ml and ionomycin at 1 ^g/ml for 6 and 14 hours. Keratinocyte line CCD106 and an airway 
epithelial tumor line NCI-H292 were also obtained from the ATCC. Both were cultured in 
DMEM 5% |fCS (Hyclone), 100 mM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x lO'^ M (Gibco), and 10 mM Hepes (Gibco); 
CCDl 106 cells were activated for 6 and 14 hours with approximately 5 ng/ml TNF alpha and 
1 ng/ml IL-1 beta, xyhile NCI-H292 cells were activated for 6 and 14 hours with the following : 
cytokines: 5 ng/ml IL-4, 5 ng/ml IL-9, 5 ng/ml IL-13 and 25 ng/ml IFN gamma. ■ 

For these cell lines and blood cells, RNA was prepared by lysing approximately lO' 
cells/ml using Trizol (Gibco BRi). Briefly. 1/10 volume of biomochloropropane (Molecular 
Research Corporation) was added to flie RNA sample, yprtexed ^and after 1 0 minutes at room 
temperature, the tubes were spun at 14,000 ipm in a Sorvall SS34 rotor. The aqueous phase 
was removed and placed m a 15 ml Falcon Tube. An equal volume of isopropanol was added 
and left at -20 degrees C overnight. The precipitated RNA was spun down at 9,000 rpm for 
15 min in a Sorvall SS34 rotor and washed m 70% ethanol. The pellet was redissolved in 300 
of RNAse-free water and 35 ^1 buffer (Promega) 5 Al DTT, 7 Ml RNAsin and 8 
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were added. The tube was incubated at 37 degrees C for 30 minutes to remove contaminating 
genomic DNA, extracted once with phenol chloroform and re-precipitated with 1/10 volume 
of 3 M sodium acetate and 2 volumes of 100% ethanol. The RNA was spun down and placed 
in RNAse free water. RNA was stored at -80 degrees C. 

AI_comprehensive panel_vl.0 

The plates for AI_comprehensive panel_vl.O include two control wells and 89 test 
samples comprised of cDNA isolated from surgical and postmortem human tissues obtained 
from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was extracted from 
tissue samples from the Backus Hospital in the Facility at CuraGen. Total RNA from other 
tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained from 
patients midergoing total knee or iap replacement surgery at the Backus Hospital. Tissue 
samples were immediately snap frozen in liquid nitrogen to ensure that isolated RNA was of 
optimal quaUty and not degraded. Additional samples of osteoarthritis and rheumatoid . 
arthritis joint tissues were obtained from Clinomics. Normal control tissues were suppUed by 
Clinomics and were obtained during autopsy of trauma victims. 

Surgical specimens of psoriatic tissues and adjacent matched tissues were provided as 
total RNA by Clinomics. Two male and two female patients were selected between the ages 
of 25 and 47. None of tiie patients were taking prescription drugs at the time samples were 
isolated. 

Surgical specimens of diseased colon from patients witii ulcerative coHtis and Crohns 
disease and adjacent matched tissues were obtained from Clinomics. Bowel 6ssue from Ibree 
female and tiuree male Crohn's patients between the ages of 41-69 were used. Two patients 
were not on prescription medication while tiie others were taking dexamethasone, 
phenobarbital, or tylenol. Ulcerative colitis tissue was &om three male and foiir female 
patients. Four of tiie patients were taking lebvid and two were on phenobarbital. 

Total RNA from post mortem lung tissue from tirauma victims was purchased from 
Clinomics. Emphysema patients ranged in age from 40-70 and all were smokers, tiiis age 
range was chosen to focus on patients with cigarette-linked emphysema and to avoid tiiose 
patients wifli alpha- lanti-trypsin deficiencies. Astiuna patients ranged in age from 36-75, and 
excluded smokers to prevent those patients tiiat could also have COPD. COPD patients 
ranged in age from 35-80 and included botii smokers and non-smokers. Most patients were 
taking corticosteroids, and bronchodilators. 
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In the labels employed to identify tissues in the AI_comprehensive panel_vl.O panel, 
the following abbreviations are used: 

Syn = Synovial . , ■ . 

Normal = No apparent disease 
5 Rep22 /Rep20 = individual patients 

- RA = Rhemnatoid arthritis 
.; Backus = From Backus Hospital. 
OA = Osteoarthritis 

(SS) (BA) (MF) = Individual patients 
10 Adj = Adjacent tissue 

Match control = adjacent tissues 
-M = Male 
-F = Female 

COPD = Chronic obstructive pulmonary disease 

15 Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from hmnan tissues and cell lines with an emphasis on metaboUc diseases. Metabolic 
tissues were obtained from patients enrolled in the Gestational Diabetes study. Cells were 
obfained during different stages in the differentiation of adipocytes from human mesenchymal 
20 stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are yomig (18 - 40 years), otherwise healthy 
women with and without gestational diabetes undergoing routine (elective) Caesarean section. 
After delivery of the infant, when the surgical incisions were being repaired/closed, the 
obstetrician removed a small sample (<1 cc) of the exposed metabolic tissues during the 
closure of each surgical level. The biopsy material was rinsed in sterile saline, blotted and fast 
frozen within 5 minutes from the time of removal. The tissue was then flash fiozen in Hquid 
nitrogen and stored, individually, in sterile screw-top tubes and kept on dry ice for shipment to 
or to be picked up by CuraGen. The metabolic tissues of interest include uterine wall (smooth 
muscle), visceral adipose, skeletal muscle (rectus) and subcutaneous adipose. Patient 
30 descriptions are as follows: 

Patient 2 Diabetic Hispanic, overweight, not on insuUn 
Patient 7-9 Nondiabetic Caucasian and obese (BMI>30) 
Patient 10 Diabetic Hispanic, overweight, on rnsulm 

Patient 11 Nondiabetic African American and overweight 
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Patient 12 Diabetic Hispanic on insulin 

Adiocyte differentiation was induced in donor progenitor cells obtained from Osirus (a 
division of Clonetics/BioWhittaker) in triplicate except for Donor 3U which had only two 
5 replicates. Scientists at Clonetics isolated, grew and differentiated human mesenchymal stem 
cells (HuMSCs) for CuraGen based on the published protocol found in Mark F. Pittenger, et 
al., Multilineage Potential of Adult Human Mesenchymal Stem Cells Science Apr 2 1999: 
143-147. Clonetics provided Trizol lysates or frozen pellets suitable for mRNA isolation and 
ds cDNA production. A general description of each donor is as follows: 
10 Donor 2 and 3 U Mesenchymal Stem cells Undifferentiated Adipose 

Donor 2 and 3 AM Adipose AdiposeMidway Differentiated 

Donor2and3 AD Adipose Adipose Differentiated 



15 



20 



Human cell lines were generally obtained from ATCC (American Type Culture 
CoDection), NO or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle ceUs, small intestine, liver HepG2 
cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These cells are all 

cultured under standard recommended conditions and RNA extracted using the stando^ 
procedures. All samples were processed at CuraGen to produce single stranded cDNA, 

Panel 51 contains aU samples previously described with the addition of pancreatic islets 

from a 58 year old female patient obtained from the Diabetes Research Institute at the 
University of Miami School of Medicine. Islet tissue was processed to total RNA at an 

outside source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
25 abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK = Skeletal Muscle 
UT = Uterus 
PL = Placenta 
30 AD -Adipose Differentiated 

AM = Adipose Midway Differentiated 
U = Undifferentiated Stem Cells 

Panel CNSD.Ol 
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The plates for Panel CNSD.Ol include two control weUs and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the Harvard 
Brain Tissue Resource Center. Brains are removed from calvaria of donors between 4 and 24 
hours after death, sectioned by neuroanatomists, and frozen at -80°C in Uquid nitrogen vapor. 
5 All brains are sectioned and examined by neuropathologists to confirm dia:gnoses with clear 
associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains from 
each of the following diagnoses: Alzheimer's disease, Parkinson's disease, Huntington's 
disease. Progressive Siq)emuclear Palsy, Depression, and *.TSrormal controls"; Within each of 
10 these brains, IhefoUowing regions are represented: cingulate gyrus, temporal pole, globus 
palladus, substantia nigra, Brodman Area 4 (primary motor strip), Brodman Area 7 (parietal 
cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 17 (occipital cortex). Not all 
brain regions are represented in all cases; e.g., Huntington's disease is characterized in part by 
neurodegeneration in the globus palladus, thus this region is unpossible to obtain from ^ 
confirmed Huntington's cases. Likewise Parkinson's disease is characterized by degeneration 
of the substantia nigra making this region more difficult to obtain. Noimal control brains were 

examined for neuropathology and found to be free of any pathology consistent with 
neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following abbreviations 
20 are used: 

PSP = Progressive supranuclear palsy 
Sub Nigra = Substantia nigra 
Glob Palladus= Globus palladus 
Temp Pole = Temporal pole 
25 QngGyr^Cingulate gyrus ' ■ ! 

BA 4 = Brodman Area 4 

Panel CNS_Neurodegeneration_V1.0 

The plates for Panel CNS_Neurodegeneration_Vl .0 include two control'wells and 47 
test samples comprised of cDNA isolated from postmortem human brain tissue obtained from 
the Harvard Brain Tissue Resource Center (McLean Hospital) and the Human Brain and 

Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare System). Brains are 
removed from calvaria of donors between 4 and 24 hours after death, sectioned by ' 
neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. AU brains are sectioned and 
examined by neuropatholpgists to confirm diagnoses witii clear, associated neuropatiiolob'. 
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Disease diagnoses are taken from patient records; The panel contains six brains from 
Alzheimer's disease (AD) pateins, and eight brains from **Normal controls" who showed no 
evidence of dementia prior to death. The eight noraial control brains are divided into two 
categories: Controls with no dementia and no Alzheimer^s like pathology (Controls) and 
controls with no dementia but evidence of severe Alzheimer's like pathology, (specifically 
senile plaque load rated as level 3 on a scale of 0-3; 0 = no evidence of plaques, 3 = severe AD 
senile plaque load). Within each of these brains, the following regions are represented: 
. hippocampus, temporal cortex (Broddmann Area 21), parietal cortex (Broddmann area 7), and 
occipital cortex (Brodmann area 17). These regions were chosen to encompass all levels of 
neurodegeneration in AD. The hippocampus is a region of early and severe neuronal loss in 
AD; the temporal cortex is known to show neurodegeneration in AD after the hippocampus; 
tlie parietal cortex shows moderate neuronal death in the late stages of the disease; the 
occipital cortex is spared in AD and therefore acts as a "control" region within AD patients. 
Not all brain regions are represented in all cases. 

In the labels employed to identify tissues in the CNS JSTeurodegeneration^Vl .0 panel, 
the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like pathology 
upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology. 

Control (Path) = Control brains; pateint not demented but showing sever AD-like 

pathology 

SupTemporal Ctx = Supaior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex' 

NOVl: Calpain-like 

Expression of the NOVl gene (also referred to as 3352274) was assessed using the 
primer-probe set Ag2003 described in Table 12 Results from RTQ-PCR runs are shown in 
Tables 13 and 14. 



Table 12 . Probe Name Ag2003 



Primers 


Sequences 


TM 


Itength 


Start 
Position 


SEQ ID . 
NO: 


Forward 


5 ' -CAGCCTAATGCTGAAACCTTCT- 3 » 


59.9 • 


22 


1117 


102 


probe 


TET-5'- • 

ATCCTCAGTTCCGTTTAACGCTGCTG-3 • - 
TAMRA 


69.2. 


26 


1145 : ^ 


103 


Reverse 


5 ' -ATCCTCGTCATCCTCCTCAT-3 ' 


58 .5 


20. 


1178 


104 
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Table 13 . Panel 1.3D 





Keianve : 
£xpression(%) 




I Relative 

[ JI/JL|il CoolUll^^ /O / 

I,3dx4tm5423 
t ag2003 b2 


Tissue Name 


1.3dx4tm5423 
t ag2003 b2 


Tissue Name 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


1 0.0 


- Pancreas 


0.0 


Renal ca. 786*0 


I - 0.0 


Pancreatic ca. CAPAN 2 


0.0 


Renal ca. A498 


1 0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


1 0.0 


Thyroid 


0.7 


Renal ca. ACHN 


I ' 0.0 


Salivary gland 


0.0 


Renal ca.UO-31 


1 0.0 


Pituitary gland 


0,0 


Renal ca. TK-10 


9.2 


Brain (fetal) 


0.0 


Liver 


1 100.0 


Brain (whole) 


2.0 


Liver (fetal) ~ 


34.5 ' ' 


Brain (amygdala) 


0.0 


Liver ca. (hepatoblast) HepG2 


1 33.7 


Brain (cerebellum) 


0.0 


Lung 


1 ^^'^ 


Brain (hippocampus) 


0.0 ' 


Lung (fetal) 


1 ^"^ 


Brain (substantia nigra) 


3.0 


Lung ca. (small cell) LX-1 


1 ^'^ 1 


Brain (thalamus) 


0.0 


Lung ca. (small cell) NCI-H69 


1 0.0 1 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) SHP-77 


I 0.0 I 


Spinal cord 


0.0 


Lung ca. (large cell)NCI-H460 


0.0 1 


CNS ca. (gHo/astro) UST-MG 


0.0 


Lung ca. (non-sm. cell) A549 


0.0 j 


CNS ca. (glio/astro) U-118-MG 


0.0 


Lung ca, (non-s.cell) NCI-H23 


0.0 1 


CNS ca. (astro) SW1783 


0.0 


Lung ca (non-s.cell) HOP-62 


0.0 


CNS ca.* (neuro: met ) SK-N-AS 


0.0 


Lung ca. (non-s.cl) NCI-H522 


0.0 1 


CNS ca. (astro) SF-539 


0.0 


Lung ca. (squam.) SW 900 


0.0 1 


CNS ca. (astro) SNB-TS 


0.0 


Lung ca. (squam.) NCI-H596 


0.0 1 


CNS ca. (glio) SNB-19 • 


0.0 


Manomarv gland 


0.0 


CNSca. (glio)U251 


0.0 


Breast ca.* (pL effusion) MCF-7 1 


7.5 1 


CNS ca. (gHo) SF-295 


0.0 


Breast ca.* (pl.ef) MDA-MB-231 I 


0.0 1 


Heart (fetal) 


0.0 ' 


Breast ca.* (pi. effusion) T47D 1 


0.0 '1 


Heart 


6.3 1 


Breast ca. BT-549 | 


0.0 1 


Fetal Skeletal 


0.0 ] 


Breast ca. MDA-N | 


0.0 1 


Skeletal muscle 


0.0 ( 


Dvary j 


0.0 ' 1 


Bone marrow 


4.4 ( 


Dvarian ca. OVCAR-3 | 


'0.6 • I 


Thymus 


0.0 C 


Ovarian ca. OVCAR-4 | 


0.0^ 


Spleen 


9.7 ( 


Dvarian ca. OVCAR-5 1 


0.0 - 


Lymph node 


0.0 C 


>varian ca. OVCAR-S | 


0.0 1 


Colorectal 


0.0 - C 


)varian ca. IGROV-1 | 


0.0 j 


Stonciach 


0.0 C 


)varian ca. * (ascites) SK-OV-3 1 


0.0 


Small intestine 


0.0 I 


Items 1 


5.1 1 


Colon ca. SW480 


0,0 P 


lacenta - ^ \ 


' 7,1 ' \ 


Colon ca.* (SW480 met)SW620 


; 0,0 . p 


restate - - | 


0.0 I 


Icolon ca. HT29 


2.9 . . P 


restate ca.* (bone met)PC-3 


0.0 1 
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Colon ca.HCT- 116 


0.0 


Testis 


11.8 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688f A).T 


0.0 


83219 CC WeU to Mod Diff 
fOD03866) 


22.9 


Melanoma* (met) Hs688(B).T 


0.0 


Colon ca. HCC.2998 


1.4 


Melanoma UACC-62 


0.0 


Gastric ca.* fliver met) NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX IMVI 


0.0 


Trachea 


5.5 


Melanoma* (met) SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.6 


Table 14. Panel 4D 






Relative 
Expression('%) 




Relative 
Expre$sion(%) 


Tissue Name 


4dx4tm5530t 
ac2003 b2 


Tissue Name 


4dx4tm5530 
t ae2003 b2 


93768_SecondaiyThl anti- 
CD28/anti.CD3 


7.1 


93100 HUVEC (Endothelial) IL- 
lb 


0.0 


93769_SecondaiyTli2 anti- 
CD28/anti-CD3 


0.0 


93779_HUVEC (EndotheUal)_IFN 
gamma 


0.0 


93770_Secondary Trl anti- 
CD28/anti.CD3 


0.0 


93102_HUVEC ' 
(Endothelial)_TNF alpha + IFN 
gamma 


0.0 


93573_Secondary Thl resting day 
4-6 in IL-2 


0.0 


93101_HUVEC 
(Endothehal) TNF alpha + IL4 


0.0 


93572 Secondary Th2 resting day 
4-6inIL-2 


9.1 


93781 HUVEC (Endothelial) IL- 
11 


0.0 


93571 Secondary Trl resting day 
4-6 in IL-2 


0.0 


93583_Limg Microvascular 
Endothelial Cells none 


0.0 


93568_priniary Thl anti- 
CD28/anti-CD3 


0.0 


93584 Lune Microvascular 
Endothelial Cells TNFa (4 ng/nil) 
andlLlbd ng/ml) 


0.0. 


93569_prnnary Th2 anti- 
CD28/anti-CD3 


0.0 


92662__Microvascular Dermal 
endothelium none 


0.0 


93570jpriniary Trl_anti- 
CD28/anti "CD'^ 

^^1* '^\Jl C4XJLLX V>JU^^ 


0.0 


92663_Microsvasular Dermal 
endothelium_TNFa (4 ng/ml) and . 
ILlb (1 nj5/ml) 


0.0 


93565_j)rin3ary Thl resting dy 4-6 
in IL-2 


17.0 


93773_BTonchial 

epithelium TNFa (4 ng/ml) and 

[Lib (1 ng/ml) ** . 


98.9 


93566__priniary Th2 resting dy 4-6 
inIL-2 


■ 

0.0 


?3347_Small Airway 
Epithelium none 


0.0 


93567jpriniary Trl resting dy 4-6 
in IL-2 


< 

] 

0.0 1 


?3348_Small Affway ' 
Epithelium TNFa (4 ng/ml) and 
Lib (1 ng/ml) 


24.9 


93351_Cr)45RACD4 
lymphocyte anti-CD28/anti-.CD3 


c 

1.7 S 


)2668_Coronery Artery 
>MC resting 


.0.0 


93352_CD45RO CD4 
lymphocyte anti-CD28/anti-CD3 


s 

0.0 E 


>2669_Coronery Artery 
>MC__TNFa (4 ng/ml) and ILlb (1 
Lg/ml) 


0.0 


93251_CD8Lyn5>hocytes anti- 
CD28/anti-CD3 


0.0 9 


3107 astrocytes resting 


0.0 


93353_chromc CD8 Lymphocytes 
2ry resting dy 4-6 in IL-2 


9 

0.0 a 


3108_astrocytes TNFa (4 ng/ml) 
nd ILlb (1 ng/ml) 


< 6.5 


93574 chroidc CD8 Lymphocytes 
2ry activated CD3/CD28 


0.0 9 


2666_KU-812 (Basophil) restinp 


0.0 
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93354 CD4 none 


0.0 


92667_KU-812 


f\ n 
U.U 


93252_Secondary 
ThlAli2/Trl anti-CD95 CHll 


4.0 


93579_CCD1106 ' 
(Keratinocytes) none 


5.9 


7Jiuo jLAJs^ ceils resting 


0.0 


93580_CCD1106 
(Keratinocytes)_TNFa and IFNg 


28.9 


QO^OO T ATT" TT T 

yo/oo l^Alv cells iL-2 


6.5 


93791 Liver Cirrhosis 


100.0 { 


93787 LAKceUs IL-2+IL-12 


0.0 


93792 Lupus Kidney 


30.2 1 


93789_LAK ceUs_IL-2+IFN 
* gamma - . . . 


0.0 


93577 Na-H292 


0.0 


93790 lAK cells IL-2+IL-18 


0.0 


93358 NCI-H292 IL-4 


0.0- 1 


93104_LAK 

ceils_PMA/ionomycm and IL-18 


0,0 


93360 NCI-H292 IL-9 


1.1 


93578 NK Cells IL-.2 restine 


0.0 


93359 Na-H292 IL-13 


0.0 


93 1 09_Mixed Lymphoc3rte 
Reaction Two Way MLR 


0.0 


93357 NCr-H292 IFN Eamma 


0.0 


93 11 0_Mixed Lymphocyte 
Reaction Two Way MLR 


0.0 


93777 HPAEC - 


0.0 ' 


93 11 l_Mixed Lymphocyte 
Reaction Two Way MLR 


0.0 


93778_HPAEC_IL-1 beta/TNA 


0.0 


93 1 12_Mononuclear Cells 
(PBMCs) resting 


0.0 


93254_Normal Human Lung 


0.0 


93113 Monoimclear Cells 
(PBMCs) PWM 


0.0 


93253_Normal Human Lung 
riDroDiasx I jNx*a (4 ng/ml) and IL- 
lb (1 ng/ml) 


0.0^" 


93114 Mononuclear Cells 
(PBMCs) PHA-L 


0.0 


yozD /^jNormai jluman Lung - 
Fibroblast IL'4 


0.0 


93249 Ramos (B cell) none 


♦ 0.0 


i'jszDo^jNormai Human Lung 
Fibroblast IL-9 


0.0 


93250 Ramos (B cell) ionomycin 


0.0 


i/3ZjD__jNormai xiuman Lung 
Fibroblast IL-13 


0.0 


93349 B lymphocytes PWM - 


5-6 


93258_Normal Human Limg 
noroDiasT i_rJN gamTna 


0.0 


93350_B lymphoytes CD40Land 
IL-4 


1.0 


93106_Dermal Fibroblasts 
CCD1070 resting 


' ' o.o * 


92665_EOL-l 

(Eosinqphil)_dbcAMP 

differentiated 


0 0 1 


93361_Dermal Fibroblasts 
u<^u X u /u I Nr alpha .4 ng/ml 


■ 

0.0 


93248_EOL-l . 

(Eosinopiiil)_dbcAMP/PMAionom 
ycin 


( 

3.1 ( 


r3iuD_i-/ermal Fibroblasts ' 
::CD1070 IL-l beta 1 ng/ml 


0.0 1 


yj^jo i^enannc c^eiis none 


c 

0.0 


?3772_dermal fibroblast_IFN 
^amma 


0,0 


93355_Dendritic Cells LPS 100 
ng/mi 


0.0 £ 


3771 dermal fibroblast IL-4 


A A 

U.U 1 


93775 Dendritic Cells anti-CD40 


0.0 S 


►3260 IBD Colitis 2 


0.0. • .. 


I /4 JVionocvtes resting 


6.3 S 


3261 IBDCrohns 


0.0 


93776 Monocytes LPS 50 ng/ml 


0.0 7 


35010 Colon nonnal 


14 7 


93581 Macrophages resting 


0.0 7 


35019 Lung none 


d.b ; 


93582_Macrophages LPS 100 
ng/ml 


0.0 . 6 


4028-1 Thymus none 


36.1 


93098_HUVEC 
(Endothelial) none 


0.0 6 


4030-1 Kidney none 


0.0 


93099_HUVEC 

(Bndotiielial) starved 


0.0 







162 



BNSDOCID: <WO_{)2055702A?_L> 



wo 02/055702 




PCT/USOl/50925 



Panel 1.3D Summary Expression of the NOVl gene appears to be specific to flie 
liver, with the highest expression in normal liver tissue (CT=32.1), and significant expression 
detected in fetal liver and a liver cancer cell line as well. Since the expression of the NOVl 
5 gene appears to be associated with the liver, it could potentially be used to differentiate 

between tissues derived from the liver and other tissues. Furthermore, therapeutic modulation 
of the NOVl gene maybe beneficial in the treatment of liver related disorders, such as liver 
cirrhosis. 

Panel 4D Summary Expression of the NOVl gene is in this panel is restricted to a 
10 few samples, with highest expression detected in liver cirrhosis (CT=33.2). This result is in 
, concordance with the liver specific expression seen in Panel 1 .3D. Expression of the gene is 
also detected at low but significant levels in the thymus and "mF-aipha and IL-lbeta treated 
bronchial epithelium. The protein encoded by the NOVl gene has homology to calcium- 
activated neutral proteases (calpain). Calpains have been identified in the trachea and in the 
1 5 lung, and may be involved in tissue destruction. Ther^eutic drugs designed with the protein 
encoded for by the NOVl gene may be unportant for the treatment of asthma, emphysema, 
and liver cirrhosis Pear et al., A new subfamily of vertebrate calpains lacking a cahnodulin- 
like domain: implications for calpain regulation and evolution. Genomics. 45:175-84, 1997). 
NOV2: Epsin-Iike 

20 

Expression of the NOV2 gene (also referred to as 21421 174) was assessed using the 
primer-probe set Ag3088 described in Table BA Results from RTQ-PCR runs are shown in 
TableslS, 16, 17,18andl9. .. 

25 IWeiS. Probe Name Ag3088 



Primers 


Sequences 


TM 


Ifength 


Start 
Position 


SEQ IB NO: 


Forward 


5 » -CACGTTTACAAGGCCATGAC-3 » 


59 


20 


256 


105 


Probe 


FAM-5 ' -ATGGAGTACCTCATCAAGACCGGCTC- 
3 ' -TAMRA : . 


68.6 


26 


280 


106 


Reverse 


5 ' -ATGTTCTCCTTGCACTGCTG-3 » 


59 


20 


319 


107 



TWeJ^. Panel 1. 3D 





Relative 
Expressionf^%> 




Relative 
Expre5sion(%) 


Tissue Name 


1.3dx4tm5430 
f ae3088 bl 


Tissue Name 


1.3dx4tni5430 
f ag3088 bl 


Liver adenocarcinoma 


37.2 


Kidney ffetal) 


.11.7- . . 


Pancreas 


19.0 


Renal ca. 786-0 


11.6 
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Table 17 . Panel 2.2 





Relative 
Expression(% 




Relative 
Expressionr%) 


1 Tissue Name 


Z*zx4tnio40o 
I ag3088 bl 


Tissue Name 


2.2x4tm6408f 
ag3088 bl 


Normal Colon GENPAK 061003 


26.7 


83793 Kidnev NAT rOD04^4R^ 


94.6 


197759 Colon cancer CCiTiC)^{)f\A\ 


14.9 


98938 Kidney malignant cancer 
(OD06204B) 


12.2 


97760 Colon cancer NAT 
uOD06064^ 


14-7 


98939 Kidney normal adjacent 
tissue COD06204E) 


29.1 


97778 Colnn m-nrr^r f^^'^^f\A^ <a\ 




J5973 Kidnev Cancer rOD04450. 




16,3 


01) 


43.1 - 


97779 Colon cancer NAT 
(OD06159) 


25.3 


85974 Kidnev NAT rOD04450-0^ 


1 40.6 


l98Rfi1 r^nlAn r*a'n^«>T- ^/^T\A/Cin*7 i\A 

v»/uiuii cancer (v^jL^uOi&y /-U4 


11.5 


Kidney Cancer Qontech 8120613 


5.7 


pi862 Colon cancer NAT • 
(OD06297-015) 


15.8 


Kidney NAT aontech 8120614 


52.1 


83237 CC Gr.2 ascenH rnln« 

K \JLJKJj^A, 1 f 


9.0 


Kidney Cancer Clontech 9010320 


15.5 


83238 CC NAT ronn^09 1 ) 


10.8 


Kidney NAT Clontech 9010321 


22.4 


197766 Colon cancer metastasis 


5.8 


Kidney Cancer aontech 8120607 


83.0 


97767 Lung NAT (OD06104) 


17.5 


pCidney NAT aontech 8120608 


35.5 


87472 Colon TTiRts to limp 
|rOD04451-0n 

187473 Lime NAT ron04451-02^ 


23.2 

1 0 9 


Nomoal Uterus GENPAK 061018 


13.0 


(Normal Prostate Clontech A+ 
6546-1 r80904^8'i 

84140 Prostate Canner rOD0441 m 


22.4 


Uterus Cancer GENPAK 064011 
Normal Thyroid Clontech A+ 
6570-1 (7080817) 


I 12.3 

' 7.5 


84141 ProstatP MAT rOD044in) 


7.8 


Thyroid Cancer GENPAK 064010 
Thyroid Cancer INVTIROGEN 
A3U2152 


28.0 


jNormal Ovary Res. Gen. 

98863 Ovarian cancer (OD06283- 
03) 


Aft Q 


Thyroid NAT INVITROGEN 
A302153 j 


7.1 




^^o^nal Breast GENPAK 061019 


14.9 


p8865 Ovarian cancer 
NAT/fallopian tube (OD06283-07) 


7.6 { 


54877 Breast Cancer rOD04566^ 


14.5 ■ ■■ 


Ovarian Cancer GENPAK 06400S 


11.9 I 


Breast Cancer Res. Gen. 1 024 | 


jU. / 


97773 Ovarian cancer /'^n^/>l4^^ 

97775 Ovarian cancer NAT 
fOD06145'i 

98853,X)varian cancer (OD06455- 
03) 


11./ C 


55975 Breast Cancer ^0004590- 1 

>U 1 


60.4 


19.8 


5976 Breast CBncer MetR T 
OD0459Q^0.^^ • 1 


25.3 


8 

IH.Z (_ 


7070 Breast Cancer Metastasis T 
OD04655.05^ ' ) 




98854 Ovarian NAT (OD06455- 
07) Fallopian tube 


1.9 G 


rENPAK Breast Cancer 064006 


55.0 
20.4 


Normal Lung GENPAK 061010 


13.2 B 


reast Cancer Clontech Q 1 009 fi^ 


16.3 


^2337 Invasive poor difF. lung 
adeno (ODO4945-01 ' 


22.1 B 


reast NAT Clontech 9 1 00265 




92338 Lung NAT CODO4945-03) 


B 

13.5 A 


reast Cancer INVITROGEN | 
209073 . 1 


7.2 
7.7 


84136 Lune Malipriant Cancer 
OD03126^ 

84137 Luna NATYOnn^ii 9^ 


B] 

12.2 -A 
5,6 . 97 


reast NAT INVITROGEN 
2090734 . ' 1 

763 Breast cancCT fQD06083) |- 


"17.9 ■ 
29.5 
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90372 Lung Cancer fOD05014Al 


15.3 ' 


97764 Breast cancer node 
metastasis (OD06083) 




90373 Lung NAT (OD05014B) 


19.8 


Normal Liver GENPAK 061009 


•50.3 


97761 Lung cancer (OD0608n 


21.2 


Liver Cancer Research Genetics 
RNA 1026 


27.7 


7 / / oz jjung cancer jn A i 
{OD06081) 


12.8 


Liver Cancer Research Genetics 
RNA 1025 


80.2 


85950 Lune Cancer fOD04237.or 


5.5 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004-T 


, 51.3 


85970 Lune NAT rOD04237.02*^ 


23.6 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


7.0 


o jzjd ^cuiar ivxei iviet to i^iver 
rODO4310> 


16.4 


Paired Liver Cancer Tissue 




83256 Liver NAT rOD043 10^ 


19.3 


Paired Liver Tissue Research 
Genetics RNA fiOOS-T<r 


inn n 


84139 Melanoma Mets to Lung 
fOD0432n 


21.4 


Liver Canrpr rrTTKTP att f\/^A(\r\'2 


A 

tyZA 


84138 Lune NAT rOD0432n 


7.0 


Nonnal Bladder GENPAK 061001 


19.8 ^ 


Normal Kidney GENPAK 061008 


12.8 


Bladder Cancer Research Genetics 
RNA 1023 


lO.b 


03 /oo xsjonev Ca. JNuclear sxade 2 
rOD04338) 


59.3 


Bladder Cancer INVITROGEN 
A302173 




83787 Kidnev NATrOD04338^ 


18.1 


Normal Stomach GENPAK 

061017 


98.0 


83788 Kidnev Ca Nuclear erade 
l/2fOD04339^ 


55.8 


Gastric Cancer Clontech 9060397 


13.5 


83789 Kidnev NAT ^OD04339^ 


26.5 


y^.t^x. LMumdcn v^ioniecn yuouoyo 


41.4 


83790 Kidnev Ca. Clear cell tvpe 
fOD04340'> 


13.5 


Gastric Cancer Clontech 9060395 


26.0 . . 


83791 Kidnev NAT rOD04340^ 


29.0 


NAT Stomach Clontech 9060394 




83792 Kidnev Ca. Nuclear pradft ^ 
rOD04348V 


12.1 


Gastric Cancer GENPAK 064005 


30.9 


Table 18. Panel 4D 


. - - 




Relative 
Expression(%) 




Relative . 
£xpression(%) 


Tissue Name 


4dx4tni5510f 

_ ag:3088 b2 


Tissue Name - 


4dx4tm5510f 
ag3088 b2 


93768_Secondary Thl anti- 
CD28/anti-CD3 . 


< 

8.7 


?3100_HUVEC (Endothelial) IL- 
Ib 


6.4 


93769 Secondary Th2 anti- ^" 
CD28/anti-CD3 


c 

7.7 ; 


)3779_HUVEC (Endothelial)_IFN 
5amma 


17.7 


93770 Secondary Trl anti- 
CD28/anti-CD3 


s 

( 

9.0 s 


>3102_HUVEC 

Endo&elial)_TNF alpha + IFN 

;amma 


12.1 


93573_Secondary Thl resting day 
4-6inIL-2 


. ■ 9 
5.8 f 


3101_HUVEC 

Endothelial) TNF alpha + IL4 


16.3 


93572_Secondary Th2_resting day 
4-6inIL-2 


9 

4.1 1 


3781_HUVEC (Endothelial) IL- 
1 


14.0 


93571_Secondary Trl^resting day 
4-6inIL-2 . 


9 

4.3 E 


3583_Limg Microvascular 
ndothelial Cells none 


1 D >l 

lo.4 


93568_priniary Thl anti- 
CD28/anti-CD3 


9 

'* E 
4.3. a] 


3584;^Lung Microvascular 
ndothelial Cells TNFa(4ng/ml) 
ad ILlb fine/ml) 


12.4 ; 


93569_primaryTh2 anti- • " 
CD28/anti-CD3 . 


. 9 
■ 8.4. - 6] 


2662_Microvascuiar Dermal 
idothelium none 


. 23.4 . 
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93570_prmiaiy Trl anti- 
CD28/ariti-CD3 


11.7 


92663_Microsvasiilar Dermal 
endothehmn TNFa (4 ng/ml) and 
ILlb f 1 nR/ml) 


13.0 


93565_priniary Thl^resting dy 4-( 
inIL-2 


21.0 


93773_Bronchial 

epithehimi TNFa (4 ng/ml) and 

ILlb (1 ng/ml)** 


11.9 


93566_pnniary Th2_restiiig dy 4-( 
in IL-2 - 


10.3 


93347_,Small Airway 
Epithelixmi none 


14.4 


93567jpriniary Trl_resting dy 4-6 
mIL-2 


7.3 


93348_Small Airway 
EpitheHum TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


44.3 


93351_CD45RACD4 
lymphocyte anti-CD28/anti-CD3 


12.7 


92668_Coronery Artery 
SMC resting 


24.4 


93352_CD45RO CD4 
lymphocyte aiiti-CD28/aiiti-CD3 


10.3 


92669_Coroneiy Artery 
SMC^TNFa (4 ng/ml) and ILlb (1 
ng/ml) 


12.3 


93251_CD8 Lyn^Dhocj^es anti- 
CD28/anti-CD3 


7.2 


93107 astrocytes resting 


19.9 


93353_chromc CD8 Lymphoc3^es 
2ry restmg dv 4-6 in IL-2 


9.4 


93108_astrocytes TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


36.7 ' 


93574_chronic CDS Lyrophocytes 
2ry activated CD3/CD28 


8.5 


92666 KU-812 (Basophil) resting 


15.8 


93354 Cb4 none 


3.1 


92667_KU-812 
(Basophil) PMA/ionoycin 


27.8 * 


93252_Secondary 

xsix/ iiLc/ iTi anu-^L/ifD Cflll 1 


8.1 


93579_CCD1106 
(Keratinocytes) none 


14.8 


93103 LAK cells resting 


4.9 


J OU^t^CrJL/ 1 1 Oo 

(Keratinocytes) TNFa and IFNg 
** 


49.4 


93788 LAK ceUs IL-2 


11.3 


^Di^i. j-(iver \-4rmosis 


11.6 


93787 LAK cells IL-2+IL-12 


14.4 


93792 Lupus Kidney 


11.6 


gamma 


12.8 




■ 33.5 


93790 LAK cells IL-2+ IL-18 


9.0 


93358 NCI.H292 IL.4 


68.3 


93104 LJkK 

cells PMA/ionomycinandIL-18 


3.2 


93360 NC3.H292 IL^9 


41.9 


93578 NK Cells IL-2 restinp 


4.7 


93359 Na-H292 IL^13 


27.9 


93 1 09_Mixed Lyirphocyte 
Reaction Two Way MLR 


7.1 .[ 


?3357 Na-H292 IFM paTnm^i . 


21.6 


93 1 1 0_Mixed Lymphocyte 
Reaction Two Way MLR 


6.0 < 


)3777 HPAEC - 


13.8 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


c 

4.9 a 


?3778_HPAEC_IL.l beta/TNA • 
ilpha 


16 4 


93 1 12_Mononuclear Cells 
(PBMCs) resting 


c 

5.0 I 


>3254_Noimal Huinan Lung . 
fibroblast none 


* vr."T 

38.7 


931 13_Mononuclear Cells 
(PBMCs) PWM 


9 
F 

5.8 1 


3253_Noimal Himaan Lung 
ibroblast TNFa (4 ng/ml) and IL- 
b (1 ng/ml) 


46.6 


93 114_Mononuclear Cells • " 
PBMCs) PHA-L 


9 

4.1 F 


3257_Normal Human Lung 
ibroblast IL-4 


34.6 


93249 Ramos (B cell) none 


. 9 
23.0 F 


3256_Nonnal Human Lung 
ibroblast IL-9 


20.3 


93250 Ramos (B cell) ionomycin 


9 

16 8 V 


3255_Normal Human Limg * 
loroDiast iL-13 


19.8 • 


93349 B lymphocytes PWM 


9, 

7.4 F 


3258_Normal Human Lung 
Ibroblast IFN gamma 


32.4 


93350_B lymgphojrtes CD40Land " 
IL-4 - - , . 


9: 

.5.4 C 


5 1 06_Dermal Fibroblasts 
CD1070 resting 


48.1 
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92665__EOI^l 

(Eosinophil)_dbcAMP 

differentiated 


12.2 


93361_Dermal Fibroblasts 
CCD1070 TNF alpha 4 ng/ml 


39.6 


93248_EOL-l 

(Eosinophil)_dbcAMP/PMAionom 
ycin 


10.6 


93 1 05_Dermal Fibroblasts 
CCD1070 JL-l beta 1 ng/ml 


23.7 


93356 Dendritic Cells none 


inn 


93772_dernial fibroblast^IFN 
gamma 


11.6 


93355_Dendritic CeUs^LPS 100 
ng/ml 


8.8 


93771 dermal fibroblast IL-4 


23.5 


93775 Dendritic CeUs anti-CD40 


11.9 


93260 IBD Colitis 2 . 


5.0 


93774 Monocytes resting 




y32ol IBD Crohns 


12.2 


93776 Monocytes LPS 50 ng/ml 


8.5 


735010 Colon nonnal 


100.0 


93581 Macrophages resting 


11.6 


735019 Lung none 


11.4 


93582_Macrophages_LPS 100 


7.4 


64028-1 Thymus none 


25.3 


93098_HUVEC 
(Endothelial) none 


' 43.0 


54030-1 Kidney none 


8.4 


93099 JEIUVEC 
(Endolhelial) starved 


43.9 







-Table 12* Panel CNSjeurodegeneration vLQ 





iveiauve 
Expres5ionf%) 




Relative 
ii'Xpre5sion(%j 


Tissue Name 


tm7048f_ 


Tissue Name 


tm7048f_ 
ag3Q88 a2 si 


AD 1 Hippo 


19.7 


Control (Path) 3 Temporal Ctx 


14.0 


AD2HibPo 


35.6 


Control (Path) 4 Temporal Ctx 


44.4 


AD 3 Hippo 


17.9 . 


AD 1 Occipital Ctx 


27.8 


AD 4 Hippo 


17.4 


AD 2 Occipital Ctx (Missing) 


0.(j 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


15.6 , 


AD 6 Hippo 


61.7 


AD 4 Occipital Ctx 


92.3 


Contror2 Hippo 


58.0 


AD 5 Occipital Ctx 


70.8 


Control 4 Hippo 


15.2 


AD 6 Occipital Ctx 


25.9 


ControUPath) 3 Hippo 


13.4 


Control 1 Occipital Ctx 


8.7 


AD 1 Temporal Ctx 


29.1 


Control 2 Occipital Ctx 


77.7 


AD 2 Temporal Ctx 


47.3 


Control 3 Occipital Ctx 


29.6 


AD 3 Temporal Ctx 


14.7 


Control 4 Occipital Ctx ■ 


9.8 


AD 4 Temporal Ctx 


34.1 


Control (Path) 1 Occipital Ctx 


69.0 V 


AD 5 Inf Temporal Ctx 


84.2 


Control ffath) 2 Occipital C:te 


16.7 


AD 5 Sup Temporal Ctx* 


47.4 


Control (Path) 3 Occipital Ctx 


7.0 


AD 6 Inf Temporal Ctx 


65.5 ( 


Control (Path) 4 Occipital Ctx 


24.5 


AD 6 Sup Temporal Ctx 


60.2 < 


[Control 1 Parietal Ctx ■ 


13.8 . 


Control 1 Temporal Ctx 


10.0 ( 


I^ontrol 2 Parietal Ctx 


66.5 


Control 2 Temporal Ctx 


69.1 ( 


Control 3 Parietal Ctx ^ 


19.8 


Control 3 Temporal Ctx 


33.8 ( 


:ontrol (Path) 1 Parietal Ctx 


63.4 


Control 3 Temporal Ctx 


17.8 C 


:ontrol (Path) 2 Parietal Ctx 


33.1 


Control (Path) 1 Temporal Ctx 


67.6 C 


Control (Path) 3 Parietal Ctx 


8.6 . 


Control fPath) 2 Temporal Ctx 


52.5 •" C 


Control (Path) 4 Parietal Ctx , 


59.2 
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Panel 1.3D Summary The N0V2 gene is widely expressed in many of the samples in 
this panel, with highest expression in a brain cancer cell line (CT = 26). The N0V2 gene is 
also highly expressed in all the normal tissues originating in the central nervous system, 
5 including the amygdala, cerebellum, hippocampus, substantia nigra, flialamus, cerebral cortex 
and spinal cord. The protein encoded by the NOV2 gene is a homolog of epsin, which is 
involved in the phagocytosis of macromolecules, and interacts with Huntmgtin-interacting 
protein. Therefore, tiiis gene may play a critical role in the endocytosis of Huntingtin protein 
and the etiology of Huntington's disease. Downregulation of this gene or its protein product 
1 0 may be of therapeutic benefit in the treatment of Huntington's disease. 

The NOV2 gene is also expressed in many tissues with metaboUc function, including 
adipose, the pancreas, the adrenal, thyroid, and pituitary glands, and skeletal muscle, heart and 
liver from both fetal and adult sources. Thus, fliis gene product may be important in the 
patiiogenesis and/or treatment of diseasie in any or all of these tissues, including obesity and 
15 diabetes. 

The NOV2 gene is highly expressed in renal, breast, brain, ovarian, lung, colon, 
kidney, pancreatic and prostate cancer cell Unes, when compared to normal kidney, breast^ 
ovary, and protate tissues, and thus may play a role in cancer of these tissues. The gene may 
also play a role in metastasis of melanoma as one cell line expresses this gene at a higher level 
20 compared to other melanoma cell lines. Based on this expression profile, the expression of the 

NOV2 gene could be of use as a marker for different grades/ types of these cancers. 
Furthermore, since this gene is expressed in multiple fetal tissues and cancer cell hnes, 

. Panel 2.2 iSummary Highest expression of the NOV2 gene is detected in Uver tissue 
adjacent to a Uver tumor. (CT = 27.3). Jn addition, the level of expression in some lung, breast. 
Uver and kidney cancer tissue samples appears to be increased when compared to the matched 
normal tissue. The reverse appears to be true for colon, ovary and stomach tissue, where ^ 
expression is sUghtly higher in normal tissue than the matched cancer tissues. Thus, based 
upon its profile, the expression of the NOV2 gene could be of use as a marker for 
distinguishing some pancers firom. the normal adjacent tissue or as a marker for different 
30 grades/ types of cancer. 

. Panel 4D Summary The NOV2 gene is iiiost highly expressed in colon (CT=22). 
Significant expression is also detected m a variety of tissues including fibroblasts, endotheUal 
and epitheUal cells, keratinocytes, leukocytes and smooth muscle cells. The protein encoded 
by the NOV2 gene is a homolog of an EH-ddmain binding like protein, epsin, thought to be 
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involved in endocytosis. Members of the epsin family have been shown to play an important 
role in womid healing. Since the NOV2 gene is expressed in several cell types, therapeutics 
designed with the protein encoded for by this gene may serve important roles in regulating the 
cellular uptake of bio-therapeutic molecules in general, and specifically in enhancing wound 
healing. 

Panel CNS_neurodegeneration_vl.O Summary Highest expression of the NOV2 
gene is detected in the hippocampus of a patient with Alzheimer's disease (CT=25 6).. 
However, there is also widespread expression in all the samples in this panel and no specific 
associatipn between the expression pf this gene and the presence of Alzheimer's disease is 
observed Scorn these results. These results do however confirm expression of the NOV2 gene 
in the brains of an additional set of individuals. Please see Panel 1 .3D for a discussion of 
potential utility of this gene in the central nervous system (Rosenthal et al.. The epsins define a 
family of proteins that interact with components of the clathrin coat and contain a new protem 
module. J Biol Chem. 274:33959-65, 1999; Mishra et al., Clathrin- and AP-2-binding sites in 
HIPl uncover a general assembly role for endocytic accessory proteins. J Biol Chemi 2001; 
SpradUng et al., Epsin 3 is a novel extracellular matrix-induced transcript specific to wounded 
^ithelia. J Biol Chetn. 276:29257-67, 2001). 

NQV3: Low Denstty Lipoprotein B-like 

Expression of the NOV3 gene (also referred to as AC025263_dal) was assessed using 
the primer-probe sets Ag2002 and Ag2452 described in Tables 20 and 21 . Results fiom RTQ- 
PCR runs are shown in Tables 22, 23, 24 and 25. - ' 



Table 20 . Probe Name Ag2002 



Primers, . 


- Sequences " 


' TM ' 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 « -GCCAGAAAGGCAACTATTCAG-3 ' 


59 


21 


727 


108 


Probe 


FAM-5 ' -AAGTTCTCAACCAGCCACACCATGGT-3 » - 
TAMRA 


69.7 


26 


749 


109 


Reverse 


5 » -AGCAACTCCACTAATQAGCAAA-3 • " 


59 


22 


794 


110 














Table 21. P 


robe Name Ag2452 










Primers 


Sequences 


TM 


Length 


Start 
Position 


SEQ ID 
NOs 


Forward 


5 ' -AGCAGTGCAGTTGTGAAAGTTT-3 ' 


59.1 


. 22 


2053 


111 


Probe 


TET-5 ' -TGATTCATGGATTCACCCAGTCATTA-3 • - " 
TAMRA 


65 .5 


26 


2075 


112 


Reverse 


5 • -CAGAACTGAGCCAGCATCAT-3 » • 


59 


' 20 


2106 


113 







Table 22 . Panel 1.3D 
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Tissue Name 


Relative 
Expression (%' 


Relative 
£xpressio]if%) 


1.3Dtm3824t 
ae2452 


as2002 


liver adenocarcinoma 


6.1 


15.9 


Pancreas 


3.1 


5.0 


Pancreatic ca. CAPAN 2 


1.7 


3.8 


Adrenal gland 


7.7 


12.7 


Thyroid 


6.7 


13.6 


Salivary gland 


4.9 


7.3 . 


Pituitary gland 


24.8 


23.8 1 


Brain (fetal) 


8.9 


8.5 1 


Brain (whole) 


18.9 


33.9 . 1 


Brain (amygdala) 


28.9 


19.6 


Brain (cerebeUxun) - 


9.1 • 


• • 8.5 • 


Brain (hippocampus) 


100.0 


48.6 1 


Brain (substantia nigra) 


4.3 


5.3 j 


Brain (thalamus) 


13.1 


• 15.4 1 


Cerebral Cortex 


41.2 


100.0 1 


Spinal cord 


5.3 


8.5 


CNS ca. (gHo/astro) U87-MG 


5.3 


14.4 I 


CNS ca. (gUo/astro) U-1 18-MG 


20.0 


39.5 1 


CNS ca. (astro) SW1783 


10.3 


25.5 ' 1 


CNS ca.* (neuro; met ) SK-N-AS 


34.6 


36.9 1 


CNS ca. (astro) SF-539 


3.8 


12.0 1 


CNS ca. fastro) SNB-75 


6.7 


33.7 ' 1 


CNS ca. (eUo) SNB-19 


3.9 


16.0 • 1 


CNSca. (Elio)U251 


3.9 


0.0 1 


CNS ca. (glio) SF-295 


8.2 


28.9 1 


Heart (fetal) 


9.6 


55.1 1 


Heart 


2.7 


* 7.2 1 


Fetal Skeletal 


24.8 


84.1 1 


Skeletal muscle 


3.8 


8.4 1 


Bone marrow 


4.5 


3.1 1 


Thymus 


3.3 


7.5^ ) 


Spleen 


9.0 


12.7 1 


Lymph node 


4.1 


12.9 1 


Colorectal 


5.9 


18.7 1 


Stomach 


5.2 


17.7 1 


Small intestine 


10.4 


10.4 1 


Colon ca. SW480 


7.8 


34.2. 1 


Colon ca.* (SW480 met)SW620 


6.0 


17.1 


colon ca. HT29 


3.8 


10.2 I 


Colon ca. HCT-1 16 ^ 


5.8 


9.2 


Colon ca. CaCo-2 


5.4 


22.2 


83219 CC Well to Mod Diff rOD03866^ 


5.3 


15.7 • 1 
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Colon ca. HCC-2998 . 


j 10.7 


14.9 


Gastric ca.* fliver inet) NCI-N87 


1 10/7 




Bladder 


1 3.8 


5.3 


Trachea 


1 13.2 


14.0 


Kidney 


1 2.3 


3.3 


Kidney f fetal) 


1 5.9 


9.7 


Renal ca. 786-0 


1 2.7 


6.8 


Renal ca.A498 


1 14.8 


34.9 


Renal ca. RXF 393 


1 1.3 


6.9 . 


Renal ca. ACHN 


1 1.4 


24.5 


Renal ca. UO-31 


1 3.7 


15.5 


Renal ca. TK-10 


1 4.6 


14.9 


liver 


1 2.9 


2.8 • 


Liver f fetal) 


1 ^"^ 


7.9 . 


Liver ca. (hepatoblast) HepG2 


1 5.8 


28.1 ■' 


Lung 


1 11.7 


7.5 


Lung (fetal) * 


1 7.6 


14.6 


^ Lung ca. (small cell) LX-1 


1 ^'^ 


16.6 ■ 


Lung ca. (small cell) NCI-H69 


1 14.7 


36.1 


Lung ca. (s.cell var.) SHP-77 


1 15.6 


30.6 


Lung ca. (large cell)NCI-H460 


2.2 


4.5 • 


Lung ca. (non-sm. cell) A549 


8.2 


12.0 


Lung ca. (non-s.cell) NCI-H23 


3.8 


15.4 


Lung ca (non-s.cell) HOP-62 


5.1 


21^8 


Lung ca. (non-s.cl) NCI-H522 


5.5 


18.3 1 


Lung ca. (squam.) SW 900 


4.0 


9.8 


Lung ca. (squam.) Na-H596 j 


3.1 


14.7 


Mammary gland 1 


11.2 


27.5 . 


Breast ca.* (pL effusion) MCF-7 | 


7.3 


.23.7 


Breast ca.* (pl.ef) MDA-MB-231 j 


23.7 


39.8 


Breast ca.* (pi. effusion) T47D | 


8.4 


37.1 ■ I 


Breast ca. BT-549 - : | 


11.0 


16.4 1 


Breast ca. MDA-N j 


8.7 


20.6 1 


. Ovary ' 


17.6 


52.5 


Ovarian ca. OVCAR-3 | 


4.9 


19.9 


Ovarian ca. OVCAR-4 


0.9 


3.3 


Ovarian ca: OVCAR-5 ■ * .. .. 


7.0 


32.5 


Ovarian ca. OVCAR-8 ' ( 


5.4 


14.4 


Ovarian ca.IGROV-1 ^ 


1.9 


'''' ■ 3.8 - 


Ovarian ca.* (ascites) SK-OV-3 | 


4^4 


12.0 


Uterus - j 


7.6 


14.2 


Placenta ' " 1 ' 


7.9 


13.2 


Prostate ( 


6.0 


6.8 


Prostate ca.* (bone met)PC-3 ' . •. . . . j 


' 8.1 


18.4 ■ 


Testis 1 


10.6 


19.6 1 
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Melanoma Hs688(A).T . 


3.7 


28.9 


Melanoma* (met) Hs688(B).T 


2.3 


45.7 


Melanoma UACC-62 


1.1 


3.3 


Melanoma Ml 4 


1.2 


3.5 


Melanoma LOX IMVI 


9.1 


6.7 


Melanoma* (met) SK-MEL-5 


12.9 


13.7 


Adipose 


2.6 


4.6 



Table 23. Panel 2D 





Relative 
£xpression(%) 




Relative 
Expression(% 

) 


2Dtin3825t_ 
ag2452 


Tissue Name 


2Dtm3825t a 
E2452 ^ 


Nonnal Colon GENPAK 061003 


100.0 


Kidney NAT Clontech 8120608 


24.5 


83219 CC WeU to Mod Diff 
{{JuUdooo) 


18.2 


Kidney Cancer Clontech 8120613 


51.1 


83220 CC NAT rOD03866^ 


19.1 


Kidney NAT Clontech 8120614 


29.1 


83221 CC Gr.2 rectosiemoid 
rOD03868^ 


8,0 


Kidney Cancer Clontech 9010320 


193 


83222 CC NAT rOD03868^ 


6.0 


Kidney NAT Clontech 9010321 


31.4 


83235 CC Mod Diff CODO3920'i 




JNormal uterus Cj£NPAK 061018 


6.8 


83236 CC NAT fODO3920^ 


24.0 


Uterus Cancer GENPAK 06401 1 


32.1 


/ CC Ur.2 ascend colon 

rOD0392n 


91.4 


Nomaal Thyroid Clontech A+ 
6570-1 


29.9 


83238 CC NAT rOD0392n 


19.8 


Thyroid Cancer GENPAK 064010 


28.3 


CC iTom j'artial 
Heoatectomv ( ODO4309^ 


66.4 


Thyroid Cancer INVITROGEN 
A302152 


17.1 


83242 Liver NAT rOD04369^ 


21.2 


Thyroid NAT INVITROGEN 
A302153 


29.9 


67472 Colon mets to lung 
rOD04451-0n 


24.3 


Nonnal Breast GENPAK 061019 


25.7 


87473 LuneNAT rOD04451-02) 


14.7 


84877 Breast Cancer rOD04566^ 


15.3 


Nonnal Prostate Clontech A+ 
6546-1 




55975 Breast Cancer rOn04500- 


' 76.8 


84140 Prostate Cancer rOD04410^ 


38.7 


85976 Breast Cancer Mete 
OD04590-0.^^ 


68.3 


84141 Prostate NAT rOD04410^ 


35.8 


57070 Breast Cancer Metastasis 
OD04655.OS^ 


77.9 


87073 Prostate Cancer rOD04720. 
01} 


52.5 


GENPAK Breast Cancer 064006 


14.2 


87074 Prostate NAT rOD0475n. 
021 


68.3 ] 


Breast Cancer Res. Gen. 1024 . 


24.3 • 


Normal Lung GENPAK 061010 


35.8 ] 


Breast Cancer Qontech 9100266 


68.3 


83239 Lune Met to Muscle 
rOD04286^ 


28.3 ] 


Breast NAT Clontech 9100265 


31.6 


83240 Muscle NAT rOD0498<^) 


1 

17.8 I 


Breast Cancer INVITROGEN 
\209073 


35.4 


84136 Lune Malignant Cancer 
rOD03126) . - 


I 

35.6 i 


3reast NAT INVITROGEN 
U090734 ' 


22.8 


84137 Lune NAT rODO^I^YTk 


' 45.1 I 


formal Liver GENPAK 061009 


9.6 . 


84871 Lunc Cancer ror>044n4^ > 


17.9 I 


jver Cancer GENPAK 06400'^ 


9.3 
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84872 Lunc NAT ^0004404"^ 


18.0 


Liver Cancer Research Genetics 
RNA 1025' ■ ^ * -* ' 


-9.7 • 


84875 Ltmff Cancer rOD64565^ 


6.9 


Liver Cancer Research Genetics 
KT^A 1026 


9.6 . 


<S4o /o i-nne NAT ( OD04565) 


8.2 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004-T 


13.6 ' 


85950 Lune Cancer COD04237.0 11 


50.0 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


18.8. 


85970 Lune NAT rOD04237-07;k 


16.8 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005-T 


10.3 


o^^^j^ wcuiax iviei iviet to L,iver 
fODO4310^ 


19.9 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


1.7 ' 


83256 Liver NAT rOD043 1 0^ 


18.6 


Normal Bladder GENPAK 061001 


53.2 1 


84139 Melanoma Mats to Lunp 
rOD0432n 


35.1 


Bladder Cancer Research Genetics 
RNA 1023 


37.1 


84138 Lune NAT rOD04.^2n 


35.1 


Bladder Cancer INVITROGEN 
A302173 


26.6 . 


Normal Kidney GENPAK 061008 
/ oo jsjonev *^a* iMuciear grade 2 


57.4 


87071 Bladder Cancer fOD04718- 

oa 

87072 Bladder Normal Adjacpnt 


46.3 


fOD04338^ 

83787 KidnevNAT ror)04^'^R^ 


58.2 
30.1 


rOD047 18-03^ 

Normal Ovaiy Res. Gen. 


24.8 

41.8 1 


83788 Kidnev Ca Nuclear pr«H« 
1/2 rOD04339^ 


26.4 


Ovarian Cancer GENPAK 064008 


54.0 


oj> /oy jsjcinev JNAT iOD04339i 


35.4 


87492 Ovarv Cancer rnnfU7#;R. 
071 


76.8 ' 


83790 Kidnev Ca. Clear cell typp, 
rOD04340^ 


38.7 


87493 Ovarv NAT rOD0476R-n«^ 


1A C ' 1 


83791 Kidnev NAT rOD0434m 


28.7 ( 


Normal Stomach GENPAK 
D61017 


33.0 


83792 Kidnev Ca. Nnrlear erade 3 
UU0434S) 

83793 KidnevNAT rnD04^4R) 


18.3 ( 


jastric Cancer Clontech 9060358 


11.5 


87474 Kidnev Cancer rOD04622. 

an . - 

87475 KidnevNAT rOD04622-03t 


25.7 1 

18.4 ( 

7.0 ^ 


^AT Stomach Clontech 9060359 
jastric Cancer Clontech 9060395 


28.5 1 
35.1 


i55973 Kidnev Cancer rOD04450- 
Qll 

85974 Kidnev NAT rOT>04450-03^ 


25.7 C 


^AT Stomach Clontech 9060394 
jastric Cancer Clontech 9060397 


40.3^ 1 
71.7 


Kidney Cancer Clontech 8120607 


24.1 ^ 

13.2 C 


TAT Stomach Clontech 9060396 
Gastric Cancer GENPAK 064005 


1Z2 n . 
35.1 -1 



Table 24 . Panel 4D 





Relative 
Expression(%) 


Relative 
ExpressionCyo^ 


Tissue Name 


4dx4tm5532f a 
&2002 a2 


4Btin3826t 
ag2452 


93768 Secondarv^Thl anti-CD28/anti-CD3 


25.4 


27:4 \ 


93769 Secondary Th2 anti-CD28/anti-CD3 


35.8 


16.7 - 


93770 Secondary Trl anti-CD28/anti-CD3 


36.9 


. • 46.3 


93573 Secondary Thl resting day 4-6 in IL-2 


' 25.8 


13.4 


93572 Secondary Th2 resting day 4-6 in IL«2 


18.9 


17.8 


93571 Secondary Trl resting day 4-6 in IL-2 


23.0 


17.0. . 


93568 prmlaryThl anti-CD28/anti-CD3 


15.9 


- . 31.0. . . . . 
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93569 priinaiyTb2 anti-CD28/aDti-CD3 



[93570 primary Trl anti-CD28/anti-CD3 
)3565 primary Thl resting dv 4-6 in IL*2 



»3566 primary Th2- resting dy 4-6 in IL-2 



30.9 



30.2 



77.0 



33.8 



26.8 



35.8 



83.5 



39.2 



93567 primary Trl resting dy 4-6 in EL-2 



29.2 



23.2 



93351 CD45RA CD4 lymphocyte apti-CD28/anti-CD3 

93352 CD45RO 0)4 lymphocyte anti-CT)28/anti-CD3 



26.1 



37.1 



30.1 



34.9 



93251 CDS Lymphocytes "anti-CD28/aiiti-CP3 



21.8 



16.8 



93353_chronic CDS Lymphocytes 2ry resting dy 4-6 in IL-2 



26.8 



24.3 



93574 chronic CDS Lymphocytes 2ry activated CD3/CD28 



23.1 



28.9 



93354 CD4 none 



22.9 



19.6 



93252 Secondary Thl/Th2/Trl anti-CD95 CHll 
93103 LAK cells resting 



29.2 



16.9 



22.5 



21.5 



g3788 LAK. cells IL-2 



^3787 LAK cells IL-2+IL-12 



93789 LAK cells IL-2+IFN gamma 



33.2 



33.1 



35.0 



22.8 



18.4. 



37.9 



193790 LAK cells IL-2+IL-18 



30.1 



35.6 



93104 LAK cells PMA/ionomycin andIL-18 



93578 NK CeDs IL-2 resting 



93109 Mixed Lymphocyte Reaction Two Way MLR 



5.6 



24.2 



29.3 



6.0 



19.3 



28.9 



93110 Mixed Lymphocyte Reaction Two Way MLR 



22.3 



15.3 



93 1 1 1 Mixed Lymphocyte Reaction Two Way MLR 



21.6 



12.1 



93112 Mononuclear Cells CPBMCs) resting 



93113 Mononuclear Cells (PBMCs) PWM 



93114 Mononnclear Cells (PBMCs) PHA-L 



93249 Ramos (B cell) none 



14.6 



22.3 



11.5 



27.1 



12.4 



57.8 



28.7 



21.5 



93250 Ramos (B cell) ionomycin 



16.9 



66.9 



93349 B lymphocytes PWM 



23.6 



65.5 



93350 Blymphoytes CD40LandIL-4 



92665 EOL-1 (Eosinophil) dbcAMP diflFerentiated 
93248 EOI^l (Eosinophil) dbcAMP/PMAionomvcin 



23.6 



8.8 



83 



27.5 



6.0 



6.6 



|93356 Dendritic Cells none 



14.0 



10.1 



93355 Dendritic Cells LPS 100 ng/ml 



14.0 



10.4 



193775 Dendritic Cells anti-CD40 



19.4 



15.0 



93774 Monocjrtes resting 



25.3 



22.5 



93776 Monocytes LPS50ng^ 
93581 Macrophages, resting 



25.4 



21.6 



20.0 



24.8 



93582 Macrophages LPS 100 ng/ml 



16.5 



13.7 



93098 HUVEC (Endothelial) none 



93099 HUVEC (Endothelial) starved 



31.1 



45.2 



36.9 



55.9 



93100 HUVEC (Endothelial) n^lb 



93779 HUVEC (Endothelia l) TPNT gamma 



15.8 



37.6 



24.7 



45.4 



93 102 HUV EC (Endothelial) TNF alpha + IFN gamma 



29.3 



27.4 



93101 HUVE C (Endothelial) TNF alpha + m 



27.6 



20.4 



193781 HUVEC (Endothelial) IL-il 



13.9 



11.7 
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93583 Liing Microvascvdar Endolfaeiial Cells none 

93584_Lung Microvascular Endothelial Cells TNFa (4 ng/ml) and 
ILlbang/ml) ~ ■ . 



21.2 



26.4 



29.0 



92662 Microvascnlar Dermal endothelimn none 



92663_Microsyasular Dermal endoflielium_TNFa (4 ng/ml) andELlb 
(1 ng/ml) 



25.5 



25.7 



34.9 



36.3 



28.7 



93773 Bronchial epithelium TNFa (4 ng/ml) and ILlb fl ng/ml) ** 



22.9 



6.2 



93347 Small Airway Epithelium none 



18.8 



17.7 



93348 Small Airway Epitfaelium TNFa (4 ng/ml) and ILlb (1 ng/ml) 



44.4 



51.1 



92668 Coronery Artery SMC resting 



28.0 



45.4 



y^ooy uoronery Artery SMC TNFa (4 ng/ml) and ILlb (I ng/ml) 


22.3 


25.0 


93107 astrocytes resting 


41.0 


24.0 


yilUfi astrocytes TNFa (4 ng/ml) and ILlb 0 ng/ml) 


58.4 


17.4 


yZooD Ku-ol2 (JSasopml) resting 


28.6 


31.0 


92667 KU-812 (Basoplul) PMA/ionoycin 


63.5 


65.1 


93579 CCD1106(Keratmocvtes) none 


17.0 


18.9 


93580 CCD1106 0K:eratinocytes) TNFa and IFNp ** 


79.9 


2.6 


93791 Liver Cirrhosis 


16.7 


3.3 


93792 Lupus Kidney 


21.9 


5.1 


93577 Na-H292 


24.0 


46 3 


93358 NCI-H292 IL^ 


24.3 


42 6 


93360 NCI-H292 IL-9 


25.2 




93359 Na-H292 IL-13 


11.7 




93357 NCI-H292 IFN gamma 


14.7 


37 Q 


93777 HPAEC - 


23.1 


25 7 


93778 .HPAEC IL-l beta/TNA alpha 


393 


44.4 


93254 Normal Human Lung Fibroblast none 


40.6 


26.2 


93253_Nonnal Human Lung Fibroblast^TNFa (4 ng/ml) and IL-lb (1 
ng/nu^ 


71.3 


31.6 


y^zD / jNoimal Human Ltmg Fibroblast IL-4 


52.7 


66.4 1 


93256 Normal Human Lung Fibroblast IL-9 


29.9 


67.8 : j 


93255 Normal Human Lung Fibroblast IL-l 3 


33.0 


35.1 j 


93258 Normal Human Lung Fibroblast IFN gamina 


45.0 


77.4 j 


93106 Dermal Fibroblasts CCD1070 resting 


56.3 


83.5 ' 1 


93361 Demial Fibroblasts CCD1070 TNF alpha 4 ng/ml 


84.6 


100.0 j 


93105_ Dermal Fibroblasts CCD1070 IL^l beta 1 ng/ml 


39.7 


45.4 1 


93772 dermal fibroblast IFN pamma 


15.5 


19.5 1 


93771 dermal fibroblast JIM . ... 


29.2 


42.3 1 


93260 IBD Colitis 2 


4.7 


2.2 ' 1 


93261 IBDCrohns 


7.2 


4.7 


735010 Colon normal * 


100.0 


37.9 1 


735019 Lung none 


15.0 


26.6 1 


64028-1 Thymus none . 


39.9 


55.1 j 


64030-1 Kidney none 


35.3 


67.8 1 
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Tissue Name 


Relative 

JCLijLpressioii^ /o } 


.. J 

Tissue Name 


Relative 
£xpression(%) 


tm6902t_ 
ae2452 a2s2 


tm6902t 
ag2452 a2s2 


AD 1 ffippo 


8.0 


Control (Path) 3 Temporal Ctx 


4.2 


AD 2 Hippo 


36.4 


Control (Path) 4 Temporal Ctx 


35.8 


AD 3 Hippo 


2.9 


AD 1 Occipital Ctx 


5.2 


AD 4 Hippo 


8.3 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


52.3 


AD 3 Occipital Ctx 


2.1 


AD6Hippo 


47.1 


AD 4 Occipital Ctx 


28.1 ' 


Control 2 Hippo . 


37.4 


AD 5 Occipital Ctx 


15.8 


Control 4 Hippo 


7.7 


AD 6 Occipital Ctx 


493 


Control (Path) 3 Hippo 


3.6 


Control 1 Occipital Ctx 


2.2 


AD 1 Temporal Ctx 


7.9 


Control 2 Occipital Ctx 


65.3 


AD 2 Temporal Ctx: 


. 49.1 


Control 3 Occipital Ctx 


■10.0 


AD 3 Temporal Ctx 


4.1 


Control 4 Occipital Ctx 


5.3 


AD 4 Temporal Ctx 


28.4 


Control (Path) 1 Occipital Ctx 


87.2 


AD 5 Inf Temporal Ctx 


76.5 


Control (Path) 2 Occipital Ctx 


9.0 


AD 5 SupTemporal Ctx 


32.8 


Control (Path) 3 Occipital Ctx 


1.5 


AD 6 Inf Temporal Ctx 


46.7 


Control (Path) 4 Occipital Ctx 


12.0 


AD 6 Sup Temporal Ctx 


42.1 


Control 1 Parietal Ctx 


5.0 


Control 1 Temporal Ctx 


3.7 


Control 2 Parietal Ctx 


28.9 


Control 2 Temporal Ctx 


51.3 


Control 3 Parietal Ctx 


15.5 


Control 3 Temporal Ctx 


14.0 


Control (Path) 1 Parietal Ctx 


87.4 


Control 4 Temporal Ctx 


7.6 


Control (Path) 2 Parietal Ctx 


24.7 


Control (Path) 1 Temporal Ctx 


100.0 


Control (Path) 3 Parietal Ctx 


1.3 


Control (Path) 2 Temporal Ctx 


45.4 


Control (Path) 4 Parietal Ctx 


41.9 



Panel 1.3D Summary Ag2002/Ag2452 Two experiments with two different 
probe/primer sets produce results that are in very good agreement, with hi^est e3q>ression in 
both runs occurring in regions of the brain. Expression of the NOV3 gene is highest in the 
cerebral cortex (CTs=26) in one run iand the hippocanq)us in the other (CT=27) with 
significant e>q)ression also detected in the amygdala. This expression pattern indicates a 
functional role for the NOV3 gene product in Alzheimer's disease (AD), since the gene, a low 
density lipoprotein homolog, is expressed in the regions of the brain important to AD 
pathology. Increased expression of apolipoprotein B in the serum of Alzheimer's disease, and 
evidence that LRP contributes to the pathogenesis of Alzheimer's disease suggest a' 
pathological role for the protein encoded by the N0V3 gene. Therefore, the AC024263_A 
gene product may be a promising antibody or small molecule target for the treatment of 

Alzheimer's disease. ■ , . 

High levels of expression are also detected in cell hnes derived &om brain cancer, 
breast cancer, lung cancer, kidney cancer and melanoma. In addition, the expression in normal 
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ovary seems to be higher than in cell lines derived from ovarian cancer tissues. Thus, the 
expression of this gene could be of use as a marker or as a ther^eutic for these cancers. 

The NOV3 gene is widely expressed in tissues witb metabolic function and 
significantly, is expressed at higher levels in fetal skeletal muscle (CTs=27-30) than in adult 
5 slceletal muscle (CTs=30-33). This difference in expression suggests that the NOV3 protein 
product could be involved in muscular growth or development in the fetus and therefore could 
act in a regenerative capacity in an adult. Thus, thersqieutic modulation of the NOV3 geue 
could be useful in the treatment of miiscle related diseases and treatment with the protein 
product could restore muscle mass or function to weak or dystrophic muscle. 
10 Panel 2D Summary Ag2452 H ighest expression of the NOV3 gene occurs in colon 

(CT^29.7). High levels of expression are also detectable in breast cancer, prostate cancer, 
ovarian cancer, and colon cancer when compared to their normal adjacent tissue. Thus, 
expression of the NOV3 gene could be used as a marker to detect the presence of these 
cancers. 

15 Panel 4D Summary Ag2002/Ag2452 Two experiments with two different probe and 

P™^®^ highest expression of the NOV3 gene in normal colon (CT=26.2) and demial 

fibroblasts treated with TNF-alpha(CT=29.2). Significant expression is also seen in 
fibroblasts, endothelial and epithelial cells, keratinocytes, leukocytes, smooth muscle cells and 
normal kidney. The NO V3 gene is expressed at much lower levels in colon from a patient witti 

20 inflammatory bowel disease (IBD) when compared to expression in normal colon. Similarly, 
expression in lupus kidney is much lower lhan normal kidney; Thus, the protein encoded by 
the NOV3 gene may be involved in normal tissue/cellular fimctions and at least in the kidney 
and colon, downregulation of this protein may serve as a diagnostic nMrker for lupus or IBD. 
Panel CNS_neurodegeneration_vl.O Summary Ag2452 The NOV3 gene is 

25 ejqjressed in most of the samples in this panel with hi^est expression detected in the temporal 
cortex of a control patient (CT=29.4). While no clear disease association emerged for the gene 
expression in this neurodegeneration panel, based on its homology to a low density Upoprotein 
and its expression profile in Panel 1 .3D, the NOV3 gene product remains a promising 
antibody or small molecule target for the treatment of Alzheimer's disease (CaramelU et al., 

30 Increased apoUpoprotein B serum concentration in Alzheimer's disease. Acta Neurol Scand. 
100:61-3, 1999 and Ulery et al.. Modulation ofbeta-aniyloid precursor protein processing by 
• ^?^. density lipoprotein receptor-related protein (LRP). Evidence that LRP contributes to 
the pathogenesis of Alzheimer's disease. J Biol Chem 275(10):7410-5, 2000). ' ; 
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NOV4: Purinoceptor-Iike 

Expression of N0V4 gene (also referred to as AC026756_dal) was assessed using the 
primer-probe sets Agl905 and Ag2504 described in Tables 26 and 27. Results ftom RTQ-PCR 
runs are shown in Tables 28, 29, and 30. 



Table 26 . Probe Name Agl905 



Primers 


Sequences 


TH 


Length 


Start 
Position 


SEQ ID 

NO: 


Forward 


5 ' -TGAGAATCAGATCCATGAAGCTr3 • 


58.9 


22 


1174 


114 


Probe 


TET-5»- 

CCATTAGCTGCTCTGAACACCTTTGG-3 ' - 
TAMRA 


67.9 


26 


1211 


115 


Reverse 


5 » -GTCGCTGACCACCACATATAGT-3 ' 


59 


22 


1246 


116 



Table 27 . Probe Name Ag2504 



Primers 


Sequences 


TM 


Iiength 


Start 
Position 


SEQ XD 
NO: 


Forward 


5 » -CTGAGAGCGAGTTACTGCTCAT-3 ' 


58.9 


22 


272 


117 


Probe 


TET-5*- 

TGATTCATATTGCCAAACTGAACTCTCTTG 
-3 • -TAMRA 


67.1 


30 


295 


118 


Reverse 


5 » -TGTCTCCTTTCATCTTGCAAGA-3 • 


60 


22 


328 


119 



10 Table 28 . Panel 1.3D 



Tissne Name 


Relative £xpression(%) 


1.3Dtm2783t 
afil905 


13Dtm2834t 
ael905 


lAvct adenocarcinoma 


0.0 


0.0 


Pancreas 


1.3 


3.2 


Pancreatic ca. CAP AN 2 


0.0 


0.0 


Adrenal gland 


0.0 


0.5 


Thyroid 


1.9 


1.1 


Salivary gland 


2.1 


1.2 


Pituitary gland 


0.0 


0.5 


Brain (fetal) 


2.7 


1.3 


Brain (whole) 


7.5 


9.9 


Brain (amygdala) * 


4.2 


6.7 


Biain (cerebellum) 


. 0.0 


0.0 


Brain (hippocampus) 


4.5 


10.7 


Brain* (substantia nigra) 


0.7 


0.4 


Brain (thalamus) 


15.1 


9.2 


Cerebral Cortex 


14.2 


17.3 


Spinal cord 


. 4.8 


1.0 


CNS ca. (glio/astro) U87-MG 


0.0 


0.0 


CNS ca. (glio/astrol U-118-MG 


0.4 


0.9 


CNS ca. (astro) SW1783 


0.0 


. 0.4 
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CNS ca.* (neuro; met ) SK-N-AS 


• 3.4^ . 


1.4 


CNS ca. (astro) SF-539 


0.0 


i-> 0.0 


CNS ca. (astro) SNB-75 


0.0 


0.0 


CNS ca. (glio) SNB-19 


0,0 


0.0 


CNS ca. (gHo)U251 


0.0 


0.0 


CNS ca. (glio) SF-295 " 


0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


Heart 


0.5 


0.4 


Fetal Skeletal 


2.5 


3.7 


Skeletal muscle 


0.0 


0.0 


Bone marrow 


0.4 


0.0 


Thymus 


0.0 


0.0 


Spleen 


0.9 


1.6 


Lymph node 


0.6 


1.2 


Colorectal 


3.5 


4.4 


Stomach 


1.5 


i.r 


Small intestine 


0.3 


1.3' 


Colon ca. SW480 


15.2 


18.8 ' 


Colon ca.* (SW480 met)SW620 




o.o 


Colon ca.HT29 


0.0 


0 0 


Colon ca.HCT-1 16 


0.0 




Colon ca. CaCo-2 


0.0 


J. m\J 


83219 CC WeU to Mod Diff (ODO^Rfif^^ 


30.1 




Colon ca.HCC-2998 


1.0 




Gastric ca.* (liver met) NCI-N87 


0.9 




Bladder 


0.0 


ft ft 


Trachea 


100.0 


Ol, 1 . 


Kidney 


5.3 




Kidney (fetal) 


1.7 


1 O ^ 


Renal ca. 786-0 


0.0 


ft ft 


Renal ca. A498 


0.0 


ft ft 


Renal ca. RXF 393 


0.0 


ft ft 


Renal ca. ACEQvT 


0.0 


ft ft ' 


Renal ca.UO-31 


0.0 


ft ft 


Renal ca. TK-10 


0.0 


ft ft 


Liver 


0.0 


-ft ft • 


Liver (fetal) - - 


0.0 


ft ft * 


Liver ca. (hepatoblast) HepG2 


0.0 


ft ft 




19 


1.1 


Lung (fetal) 


3.3 


o.o 


Lung ca. (small cell) LX-1 


3.1 


2.1 


Lung ca. (small cell) NCI-H69 


. LO 


0.3 


Lung ca. (s.cell var.) SHP-77 , . 


84.1 ' 


100.0 


Lung ca. (large cell)NCI-H460 


0.0 


0.0 


Lung ca. (non-sm. cell) A549 ' 


0.0 ' 


0.0 
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Lung ca. (non-s.cell) Na-H23 


1.0 


0.0 


Lung ca (non-sxell) HOP-62 


0.4 


0.0 


Lung ca, (non-s.cl) NCI-H522 


0.0 


0.0 


Lung ca. (squam.) SW 900 


0.0 


0.0 


Lung ca. (squam.) NCI-H596 


0.5 


1.0 


Mammary gland 


10.4 


15.4 


Breast ca.* (pL effusion) MCF-7 


0.0 


0.4 


Breast ca.* (pLef) MDA-MB-231 


0.0 


0.0 


Breast ca.* (pi. efbsion) T47D 


1.0 


0.5 


Breast ca. BT-549 


0-5 


1.0 


Breast ca. MDA-N 


0.0 


0.0 


Ovary 


0.0 


1.0 


Ovarian ca. OVCAJl-3 


7.9 


9.9 


Ovarian ca. OVCAR-4 


.0.0 


0.0 . 


Ovarian ca. OVCAR-5 


0.0 


0.0 


Ovarian ca. OVCAR-8 


10.1 


7.9 


Ovarian ca. IGROV-1 


0.0 


0.5 ^ 


Ovarian ca.* (ascites) SK-OV-3 


0.0 


0.0 


Uterus 


2.1 


3.9 


Placenta 


12.1 


13.6 


Prostate 


0.6 


0.5 


Prostate ca.* (bone met)PC-'3 


0.0 


.0.0 


Testis 


1.7 


1.4 


Melanoma Hs688(A).T 


0.0 


0.0 


Melanoma* (met) Hs688(B).T 


0.0 


0.0 


Melanoma UACC-62 


0.0 


0.0 • 


Melanoma M14 


0.0 


0.0 


Melanoma LOX IMVI 


0.2 


0.5 


Melanoma* (met) SK-MEL-5 


0.0 


0.0 


Adipose | 


0.0 . 


1.1 i 


Table 29. Panel 2D 


• • 




Relative 
£xpression(%) 






Relative 

Expression(%) 


Tissue Name 


2Dtni3014t 

aelPOS 


Tissue Name 


2Dtm3014t_ 
agl905 


Normal Colon GENPAK 061003 


21.6 


Kidney NAT Clontech 8120608 


0.6 


83219 CC Well fo Mod Diff 
fOD03866^ 


33.9 


Kidney Cancer Qontech 8120613 


44.1 


83220 CC NAT (OT>02R66\ 


7.5 


Kidney NAT Qontech 8120614 


2.3 


83221 CC Gr.2 rectosigmoid 
(OD03868^ ' 


6.6 


Kidney Cancer Clontech 9010320 


0.5 . 


83222 CC NAT rOD038^?8^ 


0.3 


Kidney NAT Clontech 9010321 


2.8 


83235 CC Mod Diff rODn^Q9m 


37.1 


Normal Uterus GENPAK 061018 


/ 2.2 : 


83236 CC NAT rODO3Q201 


2.9 


Uterus Cancer GENPAK 06401 1 


8.1: 


83237 CC Gr.2 ascend colon 
fOD0392n 


100.0 


Normal Thyroid Clontech A+ 
6570-1 


2.3 ' 
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83238 CC NAT rOD0392n 


11.8 


Thyroid Cancer GENPAK 064010 


0.9 


83241 CC firom Partial 
Heoatectomv CODO4309^ 


22.2 


1 03^010 oancer jjn v 11 ivC' vjiirN 
A302152 ■ 


' ^1.0 • 


83242 Liver NAT CODO4309^ 


0.0 


Thyroid NAT INVITROGEN 
A302153 


2.3 


87472 Colon mets to limp 
rOD04451-0n 


12.9 


Normal Breast GENPAK 061019 


- 4.5 


87473 Lune NAT rOD044S 1 .n:>^ 


2.3 


84877 Breast Cancer rOD04566^ 


0.3 


Normal Prostate Clontech A4- 
6546-1 


3.9 


oDy/D Jbsreast Cancer (OD04590- 

on 


" 0.0 


84140 Prostate Cancer rOD04410^ 


1.0 


oDy fo isreasi cancer jviets 
fOD04590-03^ 


0.6 


84141 Prostate NAT ^000441 0^ 


2.5 


0 / u / u jDreasi chancer jN^etastasis 
rOD04655.05^ 


0.8- 


87073 Prostate Cancer rOD04720- 

on 


4.2 


GENPAK Breast Cancer 064006 


6.9 


87074 Prostate NAT rOD04720- 
021 


4.0 


Breast Cancer Res. Gea 1024 


* 14.1 - ^ 1 


Normal Lung GENPAK 061010 


16,6 


Breast Cancer Clontech 9 1 00266 


1.0 . 


83239 Lune Met to Muscle 
rOD04286^ 


0.0 


Breast NAT Clontech 9100265 


0-4 ' 1 


83240 Muscle NAT rOD04286'k 


0.0 


Breast Cancer INVITROGEN 
A209073 


6.7 ' 


841 36 Lune Malignant Cancer 
(OD031261 


8.8 


Breast NAT INVITROGEN 
A2090734 


11.3 


84137 Lune NAT rOD03 1 26^ 


4.7 


Nonual liver GENPAK 061009 


0 0 1 


84871 Lune Cancer rOD04404^ 


3.3 


Liver Cancer GENPAK 064003 


v.v ■ 1 

0.0 


84872 Lune NAT f OD044ft4^ 


3.9 


Liver Cancer Research Genetics 
RNFA1025 . : 


0.5 ^ 


84875 Lune Cancer rOD04565^ 


0.0 


Liver Cancer Research Genetics 
KNA1026 


0.0 


84876 Lune NAT rOD04565^ 


0.6 


x^airea juiver cancer 1 issue 
Research Genetics RNA 6004-T 


0.0 


85950 Lune Cancer rOD04237-01 ^ 


10.7 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


0.6 : 


85970 LunP NAT rOD042:^7-09^ 


3.2 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005-T 


0 6 


83255 Ocular Mel Met to Liver 
rODO4310^ 


0.0 V 


Paired Liver Tissue Research 
Genetics RNA 6005-N 




83256 Liver NAT rOD043 1 0^ 


0.5 ] 


hJormal Bladder GENPAK 061001 


0.0 ' 1 


84139 Melanoma Mets to Lung 
rOD0432n 


] 

0,0 1 


Bladder Cancer Research Genetics 
RNA 1023 


0.0 


84138 Lune NAT rOD0432 U 


1 

J 

2.9 1 


Bladder Cancer INVITROGEN 
\302173 


6.3 


Normal Kidney GENPAK 06 1 008 


66.4 C 


57071 Bladder Cancer ronn471 

m. 


2.1 1 


83786 Kidnev Ca. Nuclear grade 2 
fOD04338^ 


5.8 f 


;7072 Bladder Normal Arfinr**nf 
OD04718-03^ . . 


2.3 


83787 Kidnev NAT fOD0433S^ 


49.3 1^ 


formal Ovary Res. Gen. 


- • 0.0 . 


83788 Kidnev Ca Niiclear or-nrl** 
l/2rOD04339^ 


0.0 C 


)varian Cancer GENPAK 064008 


16.4 


83789 kidnev NAT rOD04339^ 


... 8 
28.1 0 


7492 Ovarv Cancer rOT>04l7i;R- 

71 


0.5 


83790 Kidnev Ca. Clear cell typfi 
rOD04340^ 


— - 1.5 • 8 


7493 Ovarv NAT rOD047fiR.nR^ 


-o.b • V 


83791 Kidnev NAT rOD04340^ . . 


^ 

- 54.7 ^ 0 


lormal Stomach GENPAK 
61017 — - 


•* •■0.5 * -' ' i 
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83792 Kidnev Ca. Nuclear erade 3 
f OD04348> ^ " ^ ' 


'on* 


uasmc cancer L^iontecii 9Uo035o 


1.7 


83793 Kidnev NAT rOD04348^ 


12.5 


NAT Stomach Clontech 9060359 . 


1.4. 


87474 Kidnev Cancer rOD04622. 

on 


u.u 


urastnc vjancer Clontecn 9060395 


0.5 


%1A15 Kidnev NAT rOD04622.0^^ 


1.4 


NAT Stomach Clontech 9060394 


0.0 


85973 Kidnev Cancer rOD04450- 


0.0 


Gastric Cancer Clontech 9060397 


0.7 


85974 Kidnev NAT rOD04450.03^ 


71.2 


NAT Stomach Qontech 9060396 


0.0 


Kidney Cancer Clontech 8120607 


0.0 


Gastric Cancer GENPAK 064005 


1.0 



Table 30. Panel 4D 



Tissue Name ' 


Relative 
Expressionf%) 
41>tm3015t 
a&1905 


Tissue Name 


Relative 
£xpressionf%) 
4Dtm3015t 
ael905 


93768_Secondaiy Thl_anti- 
CD28/anti-CD3 


0.0 


93100 HUVEC (Endothelial) DL- 
Ib 


0.0 


93769_Secondaiy Th2_anti- 
CD28/anti-CD3 


0.0 


93779_HUVEC (EndotheliaI)_IFN 
gamma 


0.0 


93770_Secondary Trl anti- 
CD28/anti-.CD3 


0.0 


93102_HUVEC 
(Endo5ielial)jnsrF alpha + IFN 
gamma 


0.0 


93573_Secondary Thl^resting day 
4-6 in IL-2 


0.0 


93101_HUVEC 
(EndolheHal) TNF alpha + IL4 


0.0 


93572_Secondaiy Th2jresting day 
4-0 m IL-2 


0.0 


93781_HUVEC(Endothehal) IL- 
11 


0.0 


93571_Secondaiy Trl resting day 
4-6inIL.2 


0.0 


935S3_Lung Microvascular 
Endothelial Cells none 


0.0 


93568 ^primary Thl_anti- 
o/anti-CD3 


0.0 


93584_Lxmg Microvascular 
EndotheHal Cells TNFa(4ng/ml) 
and ILlb f 1 ng/ml) 


0.0 


93569jprimary Th2_anti- 
L^i^^o/anti-C./JL;3 


0.0 


92662_Microvasciilar Dermal 
endothelium none 


1.3 


93570_primary Trl anti- 
Cb28/anti.CD3 


0.0 


92663_Microsvasular Dermal 
endothehum TNFa (4 ng/ml) and 
ILlb f 1 ne/ml) 


0.0 , : 


93565jpriniary Thl resting dy 4-6 
in IL-2 


0.0 


93773_Bronchial 

epithehum TNFa (4 ng/ml) and 

ILlb (1 ne/ml) ** 


0.0 


93566_jjrimary Th2 resting dy 4-6 
in IL-2 


0.0 


93347_Small Airway 
Epitiielium none 


0.0 


93567_priniary Trl resting dy 4-6 
in IL-2 


■ 

0.0 


93348_Small Airway 
Bpithelium_TNFa (4 ng/ml) and 
CLlb (1 uR/ml) 


0.0 


9335rCD45RA CD4 
lymphocyte anti-CD28/anti-CD3 


. b.o 


?266S_Coronery Artery 
SMC restinfir 


0.0 


93352_CD45ROCD4 
lymphocyte anti.CD28/anti-CD3 


i 

1 

0.0 1 


?2669_Coronery Artery 
3MC_TNFa (4 ng/ml) and DLlb (1 
ig/ml) 


0.0 


93251_CD8 Lymphocytes anti- 
CD28/anti-CD3 


- 0.0 s 


)3107 astrocytes resting 


0.0 


93353_chronic CDS Lymphocytes 
2ry resting dy 4-6 in IL-2 


c 

0.0 . 2 


)310S_astrocytes TNFa (4 ng/ml) 
ind BLlb (1 n^ml) 


0.0 


93574_chronic CDS Lymphocytes 
2ry activated CD3/CD28 


0.0 £ 


>2666 KU-812 (Basophil) resting 


0.7 
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93354 CD4 none 


0.0 


92667_KU-812 
(Basophil) PMA/ionoycin 


1 1.4 


93252_Secondary 

X jijL/ jLuz/ in onu'-OJjyD L^rll J 


0.0 


93579_CCD1106 
(Keratinocytes) none 


1.0 


93103 LAK cells resting 


0 0 


935 80_CCD 1 1 06 
(Keratinocytes) TNFaandlFNg 

♦lie 


6.0 


93788 LAK ceUs IL-2 


0 0 


JLiver uirrnosis 


1 ' 5.6 


93787 LAK cells IL-2+IL-12 


0.0 


93792 Lupus Kidney 


1 9.0 


/ oy^LtJ\r%, ceus^lL-^+lJcvN 
gamma 


0 0 


y^D / / J>iUl-xl292 


0.0 


93790 LAK cells IL-2+IL-18 


0.0 


93358 Na.H292 IL-4 


1 0.0 


cells_PMA/ionomycm and IL-18 


0.0 


93360 NCT-H292 IL.9 


0.0 


93578 NK Cells resting 


0.0 


93359 Na-H292 IL-13 


1 0.0 


93 1 09_Mixed Lymphocyte 
Reaction Two Way MLR 


0.0 


93357 NCI-H292 IFNpaTnmfl 


0.0 . 


93 1 10_Mixed Lymphocyte 
Reaction Two Way MLR 


0.0 




0.0 


93 1 1 l_Mixed Lymphocjrte 
* Reaction Two Way MLR ' 


0.0 


93778_HPAEC_IL-1 beta/TNA 
alpha 


n n 


93 1 12_Mononuclear Cells 
CPBMCs) restdne ' 


1.4 


93254JNformal Hmnan Lung 
Fibroblast none 


J u.u 
1 fi h" 

J_ ".U 


9311 3_Mononuclear Cells 
CPBMCs) PWM 


0.0 


93253_Nonnal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL- 
lb(l ng/ml) 


0.0 


93 1 14_Mononuclear Cells 
(PBMCs) PHA-L . 


0.0 


y3257^Normal Human Lung 
Fibroblast IL-4 


■ ■ 1.2' 


93249 Ramos (B cell) none 


0.0 


y3256^Nonnal Human Lung- 
Fibroblast IL-9 


0.0 


93250 Ramos (B cell) ionomvcin 


0.0 


y3255^Normal Human Lung 
Fibroblast IL-13 | 


0.0 


93349 B lymphocytes PWM 


0.0 


93258_Normal Hiunan Lung 
Fibroblast IFN gamma 


A A 

0.0 . 


93350_B lymphoytes CD40L and 
IL4 


0.0 


93 1 06_Dermal Fibroblasts 
CCD1070 resting 


0.0 


92665_EOL.l 

(Eosinophil)^dbcAMP 

differentiated 


< 

0.0 ( 


?3361_Dennal Fibroblasts 
I^CD1070 TNF alnha 4 Wml 


A A 

0,0 


. 93248_EOD.l:: C . "' 

(Eosinophil)_dbcAMP/PMAionom 
ycin 


c 

0.0 ( 


>3 1 05_Dermal Fibroblasts 
:CD1070 IL-1 beta 1 ng/ml 


0.0 


93356 Dendritic Cells none 


£ 

Hii! g 


6772_dennal fibroblast^IFN 
'amma 


0.0 


93355_Dendritic Cells_LPS 100 
ng/ml 

93775 Dendritic Cells atiti-CD40 
93774 Monocytes resting 


0.0 9 

0.6 9 


3771 deimal fibroblast, IL-4 

3260 IBD Colitis 2 

3261 IBDCrohns 


0.0 
0.0 
0.0 


93776 Monocytes LPS 50 ng/ml 
93581 Macrophages resting 

93582_Macrophages_LPS 100 
ng/ml ^ 


0.0 7 
0.0 7. 


35010 Colon normal 
35019 Lung none 

^U26-l Thymus none 


9.6 
5.6 

100.0 


93098_HUVEC 
(Endothelial) none - 
93099_HUVEC ^" 


0.0 6^ 


^030-1 Kidney none 


0.6 


(Endothelial) starved 


0.6 


• . — - 1 
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Panel 1.3D Summary Agl905 Two experiments with the same probe and primer set 
produce results that are in good agreement with highest expression in the lung cancer cell line 
SHP-77 (CTs=30) and the trachea (CTs=30-31). There is also significant expression of the 
NOV4 gene in cell lines derived from the colon and ovary. This gene may play a role in 
different types of lung, ovary and colon cancer as it is more highly expressed in cell lines 
derived fix>m these cancCTS compared to the normal tissues. Furthermore, e3q)ression in normal 
brain and pancreas seems to be higher than cancer cell lines derived from these tissues. Thus, 
expression of the NOV4 gene could be used as a marker or as a therapeutic for colon, ovarian, 
brain, Ixmg, and pancreatic cancer. In addition, therapeutic modulation of the product of this 
gene, through the use of peptides, chimeric molectiles or small molecule drugs, may be useful 
in the therapy of these cancers. 

There is also significant expression of the NOV4 gene in tissues involved in the central 
nervous system including the amygdala, hippocampus, thalamus, cerebral cortex, and spinal 
cord. 

Purinoceptors found in GDNF sensitive sensory neurons mediate nocicq)tor fiinction. 
Since the NOV4 gene product is a homolog of a purinoceptor, agents that block the action of 
this receptor may have utihty in treating pain, either acting as analgesics or inhibiting the 
establishment of chronic pain. In addition, since adenosine plays a significant 
neuromodulatory role in brain regions such as the hippocampus, cortex, basal gan^a, and 
thalamus, the NOV4 purinoceptor-homolog is localized in a position to participate with the 
action of adenosine in these brain regions. The protein encoded by the NOV4 gene is most 
homologous to P2Y4 and P2Y6 purinoceptors, suggestmg that its fimction may be similar to 
-the PLC-mediated Ca2+ mobilization induced by these receptors. Ca2+ mbbiUzation is an 
important component of tiie molecular process leading to neurotransmitter release. Adenosine 
modulates the release of glutamate in the brain, which is the main excitatory amino acid 
neurotransmitter. Glutamate exerts excitotoxic neuronal damage and deafli in a number of 
pathological conditions, including stroke. Agonists of Al adenosine receptors attenuate this 
damage via G protein-coupled inhibition of glutamate release. Antagonists of A2 receptors 
also attenuate glutamate induced excitoxicity. Therefore, agents that inhibit or stimulate the 
protein encoded by the N0V4 gene are likely to affect glutamate release in the brain and the 
subsequent action of glutamate in these regions. If the NOV4 gene product fimctions similarly 
to the Al receptor with respect to glutamate release, then agonists of the putative receptor are 
hkely to have utility in the treatment of stroke. If the NOV4 gene product functions similarly 
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to the A2 receptor, then antagonists of the putative receptorjare likely to have utility in the 
treatment of stroke. Furthennore, antagonists of the A2a purinoceptor are antidepressants. 
Therefore, antagonists of the NOV4 gene product may be useful antidepressants.' A2a receptor 
antagonists also counter parkinspnian-like symptoms in mice, suggesting that the N0V4 gene 
5 product antagonists may also have utility in the treatment of Parkinson's disease. . 

Ag2504 Expression of the N0V4 gene is low/undetectable (Ct values >35) in all 
samples in Panel l. 3D (data not shown). 

Panel 2.2 Summary Ag2504 Expression of the NOV4 gene is low/undetectable (Ct 
values >35) in all samples in Panel 2.2 (data not shown). 

10 .. , Panel 2D Summary Agl905,Highest expression oftheNOV4 gene is detected in a 
colon cancer (CT=30.4); Furthermore, expression of this gene appears to be overexpressed in 
colon cancer when compared to normal adjacent tissue in all six matched tissue pairs present 
in this panel. Thus, expression of the AC025756_dal gene could be used to differentiate 
between colon cancer and normal tissue. Furthermore, therapeutic modulation of the function 

15 or activity of the NOV4 gene product could be effective in the treatmesnt of colon cancer. The 
NOV4 gene also shows a reverse association in fee kidney, wife overexpression of fee gene 
present in normal kidney when compared to fee corresponding cancerous tissue. Thus, 
expression of fee gene could also be used to differentiate between nomial and cancerous 
kidney tissue an<d feerapeutic modulation of fee gene product could be effective in fee 
20 treatment of renal cancer. 

Panel 4D Summary Agl905_Expression of fee NOV4 gene is limited to fee feymus 
i The putative GPCR encoded by this gene could be important in T cell ' 
development since purinoreccptors have been demonstrated in feymocytes. 
InmunomodulatOTy, therapeutic drugs designed with fee protein encoded for by fee NOV4 
gene may regulate T cell production in fee feymus and be important in preventing tissue 
rejection, ti-eating autoimmune disorders and treating viral diseases such as AIDS. In addition, 
fee tianscript or Mitibodies designed against fee protein encoded for by the transcript could be 
used as. diagnostic markers for identifying subsets of feymocytes at specific developmental 
stages. • ' ' ' . . ■ • ' 

30 A^5M Expression oftheNOV4 gene is low/undetectable (a valuM>3"4.5) m 
samples in Panel 4D (data not shown) . . ,. 

Panel CNS_neurodegeneratibn_vl.O Summary Ag2504 Expression of fee NOV4 
gene is low/undetectable (a values >35) in all samples in Panel CNS_neurodegen^tion_vI.O 
(data not shown), (Nagy et al., Apoptosis of murine feymocytes induced by extracellular ATP 
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is dose- and cytosolic pH-dependent Immunol Lett. 72:23-30, 2000; Liu et al., P2Y 
purindceptor activation mobilizes intracellular Ca2+ and induces a membrane cuixent in rat 
intracardiac neurones. J Physiol. 526 Pt 2:287-98, 2000; Ongini et al., Selective adenosine 
A2A receptor antagonists. Fannaco. 56(l-2):87-90, 2001; Chen et al., Neuroprotection by 
caffeine and A(2A) adenosine receptor inactivation in a model of Parkinson's disease. J 
Neurosci. 21 :RC143, 2001; Wardas et al., SCH 58261, an A(2A) adenosine receptor 
antagonist, counteracts parkinsonian-like muscle rigidity in rats. Synapse. 41:160-71, 2001; 
Driessen et al,. Depression of C fiber-evoked activity by intrathecally administered reactive 
red 2 in rat thalamic neurons. Brain Res. 796 (12):284-90, 1998; El Yacoubi et al.,Adenosine 
A2A receptor antagonists are potential antidepressants: evidence based on pharmacology and 
A2A receptor knockout mice. Br J Pharmacol. 134:68-77, 2001). - ^ 
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NOV5: CG8841-like 

Expression of NOV5 gene (also referred to as AC026756_dal) was assessed using the 
primer-probe set Ag2000 described in Table 3 1 . Results firom RTQ-PCR runs are shown in 
Tables 32, 33, and 34. 



Table 31 . Probe Name Ag2000 



Primers 



Sequences 



TM 



Length 



Start 
Position 



SBQ ID 
NO: 



Forward 



5 • -ACTCCACCAAGAAGATCCAGTT-3 ' 



59.1 



22 



1007 



120 



Probe 



FAM-5 ' -TCTCTTCTGGAAGCTCTGCGACTTCA- 

3'-TAMRA 



68.8 



26 



Reverse 



5 ' -GCACGAAGAAGAGGAATTTCTT-3 ' 



59 



22 



1047 



121 



1075 



122 



Table 32. Panel 1.3D 










Relative 
£xpression(%) 




Relative 
£xpression(%) 


Tissue Name 


13Btm2809f 
a£r2000 


Tissue Name 


1.3Dtiii2809f 
aff2006 


Liver adenocarcinoma 


9.8 


Kidney (fetal) 


6.0 


Pancreas 


24.8 


Renal ca. 786-0 


0.0 


Pancreatic ca, CAP AN 2 


1.3 


Renal ca. A498 


1.0 


Adrenal gland 


3.3 


Rmal ca. RXF 393 


0.0 


Thyroid 


11.0 


Renal ca. ACHN 


1.5 


Salivary gland 


30.6 


Renal ca.UO-31 


1.1 


Pituitary gland 


30.4 


ftRenal ca. TK-10 


2.4 


Brain (fetal) 


13.0 


LivCT 


0.7 


Brain (whole) 


39.2 


Liver (fetal) 


2.5 : . 


Brain (amygdala) 


23.7 


Liver ca. (hepatoblast) HeDG2 


8.8. 


Brain (cerebellum) 


21.0 


Lung 


12.9 



20 
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Brain (hippocampus) • 


46.7 


Lung (fetal) 


30 4 




Brain (substantia nigra) - 


10.4 


Lung ca. ( srtiall cell) LX-1 


™ '8.7 1 




Brain (thalamus) 


33.2 


Lung ca. (small cell) NCI-H69 






Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) SHP-77 


33 0 1 




Spinal cord 


14.6 


Lung ca. (large ceIl)NCI-H460 


0 0 




CNS ca. (glio/astro) UST-MG 


0.1 


Lung ca. (non-sm. cell) A549 


l/.y • 1 

15 0 




CNS ca. (gho/astro) U-1 18-MG 


0.3 


Lung ca. (non-s.cell) NCI-H23 


9 1 




CNS ca. (astro) SWl 783 


0.0 


Lung ca (non-s.cell) HOP-62 






CNS ca * (neuro; met ) SK-N-AS 


4.3 . 


Lung ca. (non-s.cl) NCI-H522 


J.J - 1 
1 R 1 




CNS ca. (astro) SF-539 


0.0 


Lung ca. (squam.) SW 900 


i.O 1 

90 9 




CNS ca, (astro) SNB-75 


35.6 


Lung ca. (squam.) NCI-.H596 






CNS ca, (gHo) SNB-19 


• 5.7 


Mammary gland 


ACS 1 




CNS ca. (filio) U251 


2.1 


Breast ca.* (pi. effiision'i MCF-7 


*tU. 1 1 

AO (\ *l 




CNS ca. (elio) SF-295 


2.6 


Breast ca.* (pLeft MDA-MB-23 1 


nZ.U • 1 




Heart (fetal) 


44.4 


Breast ca.* (pi. efiusion) T47D 


7"^ 9 - 1 




Heart 


3.6 


Breast ca. BT-549 






Fetal Skeletal 


69.3 


Breast ca. MDA-N 


U.lr I 
ft 9 




Skeletal muscle 


0.6 


Ovary 


I*? < I 

i /.o , 1 




Bone marrow 


1.8 


Ovarian ca. OVCAR-3 


on < 1 




Thymus 


2.9 


Ovarian ca. OVCAR-4 


a o 1 




Spleen 


14.8 


Ovarian ca. OVCAR-5 


1 5 A ' 1 




Lymph node 


8.6 


Ovarian ca. OVCAR-8 






Colorectal 


18.9 


Ovarian ca. IGROV-1 


to- 1 

i.y / 




Stomach 


68.3 


Ovarian ca.* ('ascites'^ SK.nv-^ 


O 1 1 




Small intestine 


21.9 


Jterus 


- - o o . 1 




::oIon ca. SW480 


10.0 


Placenta 


97 9 


< 


::ol6n ca. * ( S W480 met)S W620 


2.9 ] 


^restate 


25.9 


( 


2o\on ca. HT29 


16.8 I 


^rostate ca.* (bone met)PC-3 


1 R 7 


( 


:olQnca.HCT"116 - 


5.5 1 


restis • 


1 o. / / 

7.4 1 


( 

s 




11.6 I 


idelanoma Hs688(A).T 


0.0 


c 


3219 CC Well to Mod Diff 
ODQ3866^ 

'olon ca. HCC-2998 


__£7,o__^ 


Melanoma* (met) Hs688(B'\-T 


A 1 1 

u.i 1 


r 


rastric ca.* (liver met) NCI-N87 


17.2 \ 
48.6 \ 


lelanoma UACC-»62 

lelanoma M14 


0.0 ; 
0.0 1 


E 


tladder 


10.7 N 


ielanoma LOX IMVI 


0.0 1 


•1 


rachea 


36.1 K 


lelanoma* (met) SK-MEL-5 


0.7 - 1 


K 


idney 


1.9 A 


idipose 


/4,3. - 1 



Table 33 . Panel 2.2 



Tissue Name 
Normal Colon GENPAK 061003 


Relative 

Expressionr%^ 


Tissue Name 


Relative 
Expression(%) 


2.2x4tm6394 
f ae2000 al 


2.2x4tm6394 
f ag2000 al 


97759 Colon cancer (OD06064^ 


. 13.9 
'21.3 


83793 Kidnev NAT ronn4^4R) 
98938 Kidney malignant cancer 
(OD06204B) 


lOJ 
29.6 
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97760 Colon cancer NAT 
{OD06064) 


24A 


98939 Kidney normal adjacent 
tissue (OD66204E) 


'3.8 


97778 Colon cancer (OD06159) 


7.0 


85973 Kidnev Cancer fOD04450- 
01) 


4.1 


97779 Colon cancer NAT 
(OD06159) 


11.0 


85974 Kidnev NAT ^0004450.03^ 


I 5.0 


98861 Colon cancer (OD06297-04^ 


8.7 


Kidney Cancer Clontech 8120613 


1.3 


98862 Colon cancer NAT 
(OD06297-015) 


14.1 


Kidney NAT Clontech 8120614 


7.6 


83237 CC Gr.2 ascend colon 
fOD0392n 




iviuney \^ancer L/ioniecn yuiuozu 




83238 CC NAT rOD0392n 


4.8 


Kidney NAT Clontech 9010321 


2.9 


97766 Colon cancer nietastasis 




Jsaoney Cancer Clontech 8120607 


8.9 


97767 Lung NAT (OD06104) 


10.2 


Kidney NAT Clontech 8120608 


3.0 


87472 Colon mets to lung 


1 A O 


Normal Uterus GENPAK 061018 


9.0 • 


87473 LuneNAT rOD04451.02^ 


8.3 


Uterus Cancer GENPAK 06401 1 


4.9 


Normal Prostate Clontech A+ 
6546-1 (8090438) 


43.0 


Normal Thyroid Qontech A+ 
6570-1 (7080817) 


5.4 


84140 Prostate Cancer rOD04410^ 


17.2 


Thyroid Cancer GENPAK 064010 


2.8 


84141 Prostate NAT fOD04410^ 


10.4 


Thyroid Cancer INVITROGEN - 

A302152 


6.3 


Normal Ovary Res. Gen. 


7.6 


Thyroid NAT INVITROGEN 
A302153 


4.6 


98863 Ovarian cancer (OD06283- 
03) . 


9.5 


Nonnal Breast GENPAK 061019 


19.6: 


98865 Ovarian cancer 
NAT/fallopian tube (OD06283-07) 


4.7 


84877 Breast Cancer (OD04566^ 


15.8 


Ovarian Cancer GENPAK 064008 


7.3 


Breast Cancer Res. Gen. 1024 


22.3 


97773 Ovarian cancer (OD06145) 


0.4 


85975 Breast Cancer f OD04590- 
OD 


47.6 


97775 Ovarian cancer NAT 
(OD06145) 


7.3 


85976 Breast Cancer Mets 
rOD04590-03^ 


41.3 


98853 Ovarian cancer (OD06455- 
03) . 


18.0 


87070 Breast Cancer Metastasis . 
rOD04655.O5^ 


100.0 


98854 Ovarian NAT (Op06455- 
07) Fallopian tube 


2.4 


GENPAK Breast Cancer 064006 


11.1 ' 


Normal Lung GENPAK 061010 


18.6 


Breast Cancer Clontech 9100266 


49.1 ■ 


92337 Invasive poor difif. lung 
adeno (bDO4945-01 


10.0 


Breast NAT Oontech 9100265 


20.7 


92338 Lmg NAT f ODO4945-03) 


5.7 


Breast Cancer INVITROGEN 
A209073 


18.6 


84136 Lung Ma1ip;ii9nt Cancer 
fOD03126^ 


17 


Breast NAT INVITROGEN 


, -•,21.5 


84137 Lung NAT rODO:^ 1 26^ 


3.9 


57763 Breast cancer COD06083) 


81.2 


90372 Lunfi Cancer (OD05014A) 


11.4 ! 


?7764 Breast cancer node 
metastasis fOD06083) 


65.8 


90373 Lung NAT fOb05014B) 


0.2 ] 


formal Liver GENPAK 061009 


2.4 


97761 Lung cancer COD06081) 


] 

4.2 ] 


wiver Cancer Research Genetics 

»* A Wj I %.r V*ll Villi VO 

WA 1026 


4.4 


97762 Lunfi cancer NAT 
COD06081) 


J 

6.2 I 


^iver Cancer Research Genetics 
WA1025 


4.6 


85950 Lunp Cancer rOD04237-0n 


I 

4.6 - I 


^aired Liver Cancer Tissue 
lesearch Genetics RNA 6004-T 


3.8 - 


85970 Liiiia NAT (Cmt\iiO.%l ^&>\ 


9.1 I 


^aired Liver Tissue Research 


1.5 
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Genetics RNA 6004-N 




odzod fjcuiar Mel Met to Liver 
fODO4310^ 


0 7 


Paired Liver Cancer Tissue 
rvesearcn oenetics JvNA oU05-T 


'^12.1 


83256 Liver NAT rOD043 1 0^ 


2.8 


Paired Liver Tissue Research 
Genetics KNA 6005-N 




84139 Melanoma Mets to Lunfr 
rOD0432n 




L.iver cancer GENPAK 064003 


1,5 


84138 Limp NAT ron04^'?n 


9.2 


Normal Bladder GENPAK 061001 


19.6 


Nonnal Kidney GENPAK 061008 


2.4 


Bladder Cancer Research Genetics 
RNA 1023 


6.3 


83786 Kidnev Ca. Nuclear prade >> 
rOD04338^ 


9.7 


Bladder Cancer INVITROGEN 
A302173 


8.6 


83787 Kidnev NAT rOD043^R) 


1 7 


Normal Stomach GENPAK 


- 62.5 


83788 Kidnev Ca Nuclear erade 
/2 fOD04339^ 


4.2 


Gastric Cancer Clontech 9060397 


.5,1 


83789 Kidnev NAT rOD043':^0^ 


4.1 


^JAT Stomach Clontprh on/=*n^o^^ 




3790 Kidnev Ca. Clear cell typ^ 
fOD04340^ 


2.7 


Gastric Cancer Clontech 9060395 


* 21.5 


83791 Kidnev NAT COD0434n^ 


6.7 ] 


hJAT Stomach Clontech 9060394 


43,5^ 


83792 Kidnev Ca. Nuclear prade 3 
fOD04348^ 


0.6 ( 


jastric Cancer GENPAK 064005 


11.4 
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3MeJ4. Panel 4D 



Tissue Name 


Relative 

Expression(%) 




Relative 
Expression(%) 


4dx4tm5534f 
as2000 al 


Tissue Name 


4dx4fm5534f 
ag2000 al 


93768__Secondaiy Thl anti- ~" 
CD28/anti-CD3 


0.2 


93100_HUVECCEndothelial) IL- 
Ib . 


4.8 


93769_Secondary Th2 antU 
CD28/anti-CD3 


0.3 


93779_HUVEC CEndotheUal)_IFN 
gamma 


14.7 


93770_Secondary Trl anti- 
CD28/anti-.CD3 


0.6 


93102_HUVEC 

(Endothelial)_TNF alpha + IFN 
gamma 


1.9 1 


93573_Secondaiy Thl resting day 
4-6inIL-2 ' 


0.1 


93101_HUVEC 
(Endothehal) TNF alpha + IL4 


4.0 


93572_Secondary Th2 resting day 
4.6inIL-2 


0.7 


93781_HUVEC (Endothelial) IL- 
11 , ■ 


15.7 


93571_Secondary Trl resting day 
4-6inIL-2 


0.3 


93583_Lung Microvascular 
Endothelial Cells none 


14.4 


93568_priniary Thl anti- 

CD28/anti"-CD3 


0.1 


93584_Lung Microvascular 
Endothelial Cells TNFa(4ng/ml) 
andlLlb (1 ng/ml) 


6.3 


93569^riniary Th2 anti- ... 
CD28/anti-CD3 


0.2 


92662_Microvascular Dmnal- ' 
endothelium none 


15.5. 


93570 jrimary Trl anti- 
CD28/anti-CD3 


• 

0.1 ] 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and 
[Lib (1 nfi/ml) 


5.2 ' 


93565_primaiy Thl resting dy 4-6 
inIL-2 


! 

0.4 1 


W773_Bronchial 

jpithelium TNFa (4 ng/ml) and 

Lib (1 ng/ml) ** " ' 


2.6 


93566_prunary Th2 resting dy 4-6 
inIL-2 . . 


c 

0.2 - I 


>3347^Small Airway 

epithelium none 


0.8 


93567_pnniary Trl resting dy 4-6 
iDlL^2 


... S 
0.1 £ 


>3348_Small Airway, 
spithelium TNFa (4 ng/ml) and 
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ILlb (1 ng/ml) 




93351_CD45RACD4. 
lyinphocyte_anti-CD28/anti-CD3 


0.3 


92668_Coroneiy Artery 
SMC resting 


0,1 


. 93352_,CD45RO CD4 
lymphocyte anti-CD28/anti-CD3 ' 


0.7 


92669_Coronery Artery 
SMC^TNFa (4 ng/ml) and ILlb (1 
ng/ml) 


0.3 


9325 1_CD8 Lymphocytes anti- 
CD28/anti-CD3 


0.8 


93107 astrocytes resting 


3.6 


93353_chromc CD8 Lymphocytes 
2ry_resting dy 4-6 in IL-2 


0.9 


93108_astrocytes_TNFa (4 ng/ml) 
andlLlb (1 ng/ml) 


6.9 


93574_chromc CD8 Lymphocytes 
2ry_activated CD3/CD28 


0.0 


92666 KU-812 (Basophil) resting 


0.0 


93354 CD4 none 


2.7 


92667_KU-812 
(Basophil) PMA/ionoydn 


0.0 


93252_Secondaiy 
Tnl/Th2/Trl aiiti-CD95 CHI 1 


0.0 


93579_CCD1106 
(Keratinocytes) none 


0.3 


93103 LAK cells resting 




yoDoU_CC/Dl 106 
(Keratinocytes) TNFaandlE^g 


1.5 


93788 LAK cells IL-2 




JLiver \Jirrjiosis 


11.3 


93787 LAK cells IL-2+IL-12 


1.4 


93792 Lupus Kidney 


9.2 


93789_LAK cells_IL-2+ll*'J>I 
• gamma * 


1 2 




20.3 


93790 LAKceUs IL-2+IL-18 


1.6 


93358 Na-H292 IL.4 


17.5 


1 t\A T A XT 

y31U4_LAK 

cells_PMA/ionomycin and IL-18 


0.3 


93360 NCI-H292 IL-9 


21.6 


93578 NK Cells IL-2 resting 


0.4 


93359 NCI-H292 IL-13 


9.5 


93 1 09_Mixed Lyn^hocyte 
Reaction Two Way MLR 


1.2 


93357 Na-H292 IFN gamma 


10.3- 


93 1 10_Mixed Lymphocyte 
Reaction Two Way MLR 


0.4 


93777 HPAEC - 


13.7 • 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


0.0 


93778_HPAEC_IL-1 beta/INA 
alpha 


9.2 


93 1 12_Mononuclear Cells 
(PBMCs) resting 


0.8 


93254_Nonnal Human Lung 
Fibroblast none 


0.2 


93113 Mononuclear Cells 
(PBMCs) PWM 


0.2 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL- 
lb (1 ng/ml) 


0.8 


93114 Mononuclear Cells 
(PBMCs) PHA-L* ' 


ois 


93257_Normal Human Lung 
Fibroblast IL-4 


0.1 • 


93249 Ramos (B cell) none 


0.5 ] 


93256_NoTmal Human Limg 
Fibroblast IL-9 


0.2 


93250 Ramos (B cell) ionomycin 


0.7 ] 


?3255_Normal Human Lung 
Fibroblast IL-13 


0.2 


93349 B lymphocytes PWM 


0.8 ] 


?3258_Nonnal Human Lung 
Fibroblast IFN gamma 


■ 0.3 . 


93350_B lynqDhoytes CD40Land 
IL-4 


5.8 ( 


}3 106_Dermal Fibroblasts 
::CD1070 resting 


0.1 


92665_EOL-l 

(Eosinophil)_dbcAMP 

differentiated 


c 

0.0 ( 


)3361_Dermal Fibroblasts 
:CD1070 TNF alpha 4 ng/ml 


0.6 ■ 


93248_EOI^l 

(Eosinophil) dbcAMP/PMAionom 

ycin • 


c 

0.0 ( 


o luj^jjermai riDroDlasts 
:aD 1070 IL-l beta 1 nR/ml 


0.1 


93356 Dendritic Cells none 


0,2 g 


•3772_dermal fibroblast_IFN 

amma ' 


0.1 


93355 Dendritic Cells LPS 100 


0.0 19 


3771 dermal fibroblast JLA . . 


0.1. 
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10 



15 



20 



25 



ng/ml 








93775 Dendritic Cells anti-CD40 


0.0 


93260 IBDCohtis2 


^^*2.9 


93774 Monocytes resting 


0.0 


93261 IBDCrohns 


9.3 


93776 Monocytes LPS 50 np/ml 


0.0 


735010 Colon normal 


1 flo n 


93581 Macrophages resting 


0.1 


735019 Lung none 


• 19.4 ' 


93582_Macrophages_LPS 100 
ng/ml 


0.0 


64028-1 Thymus none 


11.2 ■ 


93098_HUVEC 
(Endothelial) none 


7.4 


64030-1 Kidney none 


6.4 


93099__HUVEC 
(Endothehal) starved 


17.8 







Panel 1.3D Summary Highest expression of the NOV5 gene, a homolog of a 
transmembrane multi-pass protein, is seen in the cerebral cortex (CT=26.8). with moderate 
expression detectable across all regions of the brain. Because this gene shows a large* 'down- 
regulation in brain cancers, its absence would be an excellent marker to detennine if brain 
tissue was pre-cancerous in the examining and classifying of postmortem tissue 

Expression of the NOV5 gene is also widespread among tissues with metaboHc 
relevance, including adipose, pancreas, adult and fetal heart, adult and fetal Hver, adult and 
fetal skeletal muscle, and the adrenal, pituitary, and thyroid glands. The NOV5 gene is 
expressed at much higher levels in fetal heart and skeletal muscle (CTs=28) than in adult heart 
and skeletal muscle (GTs=31-34). This differential expression pattem suggests that NOV5 
gene expression could be used to differentiate between the two tissue sources for heart and 
skeletal muscle. Furthermore, the significantly higher level of expression of the gene in fetal 
skeletal muscle suggestes that the NOV5 gene product may be involved in muscular growth or 
development in the fetus and could potentially act in a regenerative capacity in an adult. 
Therefore, therapeutic modulation of the N0V5 gene could be useful in the treatment of 
niuscie relkted diseases and tlie treatment of week or dystrophic muscle. " " ■• 

The NOV5 gene is also expressed at significant levels in cell lines derived from 
ovarian, breast, lung, gastric, prostate and colon cancers compared to the nonnal tissues. Thus, 
the expression of this gene could be of use as a marker or as a therapeutic for ovariai^'breast, ' 
lung, gastric, prostate and colon, hi addition, ther^eutic modulation o^the product of this 
gene, through the use.of peptides, chimeric molecules or.small molecide drugs, may be useful 
in the treatment of these cancers. 

(CT=28) as is seen in Panel 1. 3D. hi addition,, there is significa^^ oy^expression of the NOV5 
gene in a clust«: of breast. Imig. and ovarian cancersamples when compared to corresponding 
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ovarian and lung cancers from normal tissue and as a marker for the presence of these cancers. 
Furthermore, therapeutic modulation of the protein product of the NOV5 gene could be 
beneficial in the treatment of breast, ovarian and hmg cancers. The expression of this gene also 
shows a reverse association with some normal stomach samples when compared to the 
5 matched gastric cancer tissue. This suggests that the NOV5 gene could be used to distinguish 
between normal and cancerous gastric tissue and that therapeutic modulation of the gene 
product may be useful in the treatment of gastric cancer. 

Panel 4D Summary The highest expression of the NOV5 gene is found in the colon 
(CT=26.2), with modest expression detectable in the muco-epidremoid cell line H292, and the 

1 0 lung. It is also expressed at moderate levels on HUVEC and Ixmg microvasculature regardless 
of their activation status. The protein encoded by the NOV5 gene is homologous to an 
epidermal growth factor related protein (fibropellin like) and could be used as a marker of lung 
muco-epidermoid cells, colon or vasculature. The putative protein encoded by the transcript 
may also play an important role in the normal homeostasis of these tissues. Small molecule or 

15 antibody therapeutics designed with the NOV5 gene product could be important for 

maintaining or restoring normal function to these organs during inflammation associated with 
asthma and emphysema. 

NOV6: Synaptotagmin-like 

Expression of NOV6 gene (also referred to as SC134912642__dal) was assessed using 
20 the primer-probe set Ag2056 described in Table 35. Results from RTQ-PCR runs are shown in 
Tables 36, 37, 38, 39 and 40. 



Table 35 . Probe Name Ag2056 



Primers 


Sequences 


TH 


Ziength 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 » -CTGGTCTCTGCCATCATCAC-3 ' 


59.2 • 


20 


55 


123 


Probe 


TET-5 ' -CTTAGCX3TCACTGTCGTCCTCGCTAG- 
3'-TAMRA 


68.4 


26 


. • 82 


124 


Reverse 


5 ' -TGTAGCGTTTGCCCAGTTT-3 ' 


59.3 


19 


130 


125 



25 Table 36 . Panel 1.3D 



Tissue Name 


Relative 
Expressionf%) 


Tissue Name 


Relative 
Expression(%) 


13Dtin2580t 
ag2056 


13Dtm2580t 
as2056 


Liver adenocarcinonia 


2.4 


Kidney (fetal) 


1.9 


Pancreas 


1.8 


Renal ca. 786-0 


0.2 . 


Pancreatic ca. CAP AN 2 


2.3 


Renal ca. A498 


6.0 


Adrenal gland 


0.8 


Renal ca. RXF 393 


0.8 
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Thyroid 


1.3 


Renal ca. ACHN 


0.0 


Salivary gland 


7.9 


Renal ca.UO-31 


0.0 


Pituitary gland 


16.3 


Renal ca, TX-IO 


0.6 


Brain (fetal) 


4.8 


Liver 


0.3 


Brain (whole) 


26.8 


Liver (fetal) 


1.1 


Brain (amygdala) 


24.0 


Liver ca. (hepatoblast) HepG2 


1.8 


Brain (cerebellxun) 


8.8 " 


Lung 


0.6 


Brain (hippocampus) 


56.3 


Lung (fetal) 


0.9 


Brain (substantia nigra) 


2.9 


Lung ca. (small cell) LX-1 


7.3 


Brain (thalamus) 


23.0 


Lung ca. (small cell) NCn-H69 


16.2 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) SHP-77 


20.6 


Spinal cord 


0.6 


Lung ca. (large cell)NCI-H460 


0.1 


CNS ca. (glio/astro) U87-MG 


0.4 


Lung ca. (non-sm. cell) A549 


0.1 


CNS ca. (glio/asiro) U-1 18-MG 


19.1 


Lung ca. (non-s.cell) NCI-H23 


0.6 


CNS ca. (astro) SW1783 


1.8" 


Lung ca (non-s.cell) HOP-62 


0.3 


CNS ca * (neuro; met ) SK-N-AS 


0.6 


Lung ca, (non-s.cl) NCI-H522 


2.0 


CNS ca. (astro) SF-539 


0.5 


Lung ca. (squam.) SW 900 


1.1 


CNS ca. (astro) SNB-.75 


17.0 


Lung ca. (squam.) NCI-H596 


13.3 


CNS ca. (fijio) SNB-19 


0.0 


Mammary gland 


12.9 


CNSca.(kHo)U251 


0.1 


Breast ca.* {pi. effusion) MCF-7 


4.7 


. CNS ca. (glio) SF-.295 


0.6 


Breast ca.* (pl.ef) MDA-MB-231 


0.1 


Heart (fetal) 


2.8 


Breast ca.* (pi. effusion) T47D 


17.1 


Heart 


1.7 


Breast ca. BT-549 


0.0 


Fetal Skeletal 


9.4 


Breast ca. MDA-N 


0.1 


Skeletal nmscle 


0.1 


Ovary 


0.9 


Bone marrow 


0.0 


Ovarian ca. OVCAR-3 


2.5 


Thymus 


0.1 


Ovarian ca. OVCAll-4 


1.1 


Spleen 


1.1 


Ovarian ca. OVCAR-5 


3.9 • 


Lymph node 


0.1 


Ovarian ca. OVCAR-8 


2.3 


Colorectal 


3.2 


Ovarian ca. IGROV-1 


0.0 


Stomach , 


1.9 


Ovarian ca.* (ascites) SK-OV-3 


3.5 ' 


Small intestine 


0.3 


Uterus ^ 


• ' 1.3 ' 


Colon ca. SW480 


6.7 


Placenta 


18.0 ^ 


Colon ca.* (SW480 met)SW620 • 


03 


Prostate 


18.8 


L^oion ca. xi x ^zf. 


1.5 


Prostate ca.* (bone met)PC-3 


4.5 


^oion ca. xii^ 1 " 1 1 o 


4.5 


festis 


2.3 


L'Oion ca. oaoo-^ 


18.8 ] 


Melanoma Hs688(A).T 


^0.0 


3219 CC Well to Mod Diff 
(0003866^ 


14.7 1 


Vlelanoma* (met) Hs688(B).T 


1.6 


Colon ca. HCC-2998 


10.6 I 


Vielanoma UACC-62 


0.1 


Gastric ca.* (liver met) NCI-N87 


10.5 I 


vielanoma Ml 4 


0.0 


Bladder 


0.9 I 


vlelanoma LOX IMVI . 


0.6 


Trachea • 


3.5 . I 


vlelanoma* (met) SK-MEL-5 


L5 


Kidney 


. .0.5 . Adipose^,. . ..... 
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Table 37. Panel 2.2 



Tissue Name 


Relative 
Expression(%) 




Relative 

Expre5sion(% 1 


2Jx4tm6379 
t ag2056. al 


Tissue Name 


2.2x4tm6379 
t_ae2056 al 


Nonnal Colon GENPAK 061003 


5.3 


83793 Kidnev NAT rOD04348'> 


10.3 


97759 Colon cancer {OD06064) 


0.0 


98938 Kidney malignant cancer 
(OD06204B) 


1.3 


97760 Colon cancer NAT 
(OD06064) 


0.7 


98939 Kidney nonnal adjacent 
tissue (OD06204E) 


0.8 


97778 Colon cancer (OD06159) 


1.1 


85973 Kidnev Cancer COD04450- 
011 


0.0 


97779 Colon cancer NAT 


' 1 1 
1. / 


ojy /H Ivicinev IMAX (UJJll4450-lJi 


2.7 


98861 Colon cancer (OD06297-041 


2.9 


Kidney Cancer Clontech 8120613 


2.1 


98862 Colon cancer NAT 
fOD06297-015) 


4 1 


Jviancy in a i v^ioniecn o x zuo i *i' 




83237 CC Gt.2 ascend colon 
rOD0392n 

\ \^ X p 


9 % 


jsjoney L^ancer oxontecn yuiujzu 


A O 

0.2 


8 "^9 R PC M AT rODfS'^ 0*? 1 




isadney JN A l Clontecn 90 1 032 1 


0.6 


97766 Colon cancer metastasis 
rOD06104^ 




jsjoney cancer L^iontecn oizuou/ 


A C 

0.5 




u.u 


JsJdney r»JAl Clontecn 8120608 


1.3 


%lAni Colon mets to lunp 
rOD04451-0D 




iNormai uierus vjiiXNJrAJs. uoiulo 


1.0 


87473 Lunp NAT rOD04451-02"i 


0.0 


Uterus Cancer GENPAK 06401 1 


0.6 


Nomial Prostate Clontech A+ 
6546-1 (8090438) 


11,8 


Nonnal Thyroid Clontech A+ 
6570-1 (7080817) 


0.0 . 


84 140 Prostate Cancer COD04410) 


14.1 


Thyroid Cancer GENPAK 064010 


0.0 


84141 Prostate NAT rOD04410^ 


21.1 


Thyroid Cancer INVITROGEN 
A302152 


1.4 


Normal Ovary Res. Gen. 


0.0 


Thyroid NAT INVITROGEN 
A302153 


0.3 - 


98863 Ovarian cancer (OD06283- 

03) 


0.3 


Normal Breast GENPAK 061019 


2.9 


98865 Ovanan cancer 
NAT/fallopian tube (OD06283-07) 


0.3 


84877 Breast Cancer rOD04566^ 


2.0 


Ovarian Cancer GENPAK 064008 


0.6 


Breast Cancer Res. Gen. 1024 


8.5 "■■ 


97773 Ovarian cancer (OD06145) 


0.2 


85975 Breast Cancer rOD04590- 

01) 


39.2 


97775 Ovanan cancer NAT 
(OD06145) 


0.8 


85976 Breast Cancer Mets 
fOD04590-03^ 


22.1 


98853 Ovanan cancer (OD06455- 
03) 


3.5 


87070 Breast Cancer Metastasis 
rOD04655-05^ 


100.0 


98854 Ovanan NAT (OD06455- 
07) Fallopian tube 


0.3 


GENPAK Breast Cancer 064006 


5.7 


Normal L\mg GENPAIC 061010 


0.2 ] 


Breast Cancer Clontech 9 1 00266 


'3:6 


92337 Invasive poor diff, lung 
adeno (ODO4945-01 


1.3 1 


ireastNAT OnritRrVi 01^09*%^ 


• D.J 


92338 Lung NAT (0004945-03) 


1 

0.4 / 


breast Cancer INVITROGEN 
^09073 


1.6 


54136 Ltme Malignant Cancer 


... 1 
6.9 I 


Breast NAT INVITROGEN . 




rOD03126^ 


\2090734 


5.5 


84137 LuiipNAT rOD03126^ 


0.0 £ 


)7763 Breast cancer f OD06083) 


5.0 


90372 Lune Cancer (OD05014A) 


4.6 £ 


)7764 Breast cancer node 1 


11.7 
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metastasis (OD06083): ' ; , _ 




90373 Lung NAT (OD05014B) 


0.7 


Normal Liver GENPAK 061009 


. 2.5 


97761 Lung cancer (OD0608 1) 


LO 


RNA 1026 


0.8 


97762 Liin? cancer NAT 
(OD06081) 


0.0 


RNA 1025 


2.7 


85950 Lune Cancer raD04237-01'> 


0.7 


jrailwU. 1-/1VCX V.^d.LLL'Cl J. laoUC/ 

Research Genetics RNA 6004-T 


2.0 ' 


85970 Lune NAT <'OD04237-02^ 


0.3 


x^aireu i^vvL x issue xvesearcn 
Genetics RNA 6004-N 


2.0 


83255 Ocular Mel Met to Liver 
rODO4310^ 


3.7 


x^aucu j^ivcr v^oiicer x issue 
Research Genetics RNA 6005-T 


1.1 


83256 Liver NAT rOD043 10^ 


0.5 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


0.5 > 


84139 Melanoma Mets to Lime 
fOD0432n 


0.0 


Liver Cancer GENPAK 064003 


0.4 


84138 Lime NAT rOD0432n 


0.5 


Normal Bladder GENPAK 061001 


1.4 


Normal Kidney GENPAK 061008 


0.8 


oiaaaer v^ancer xesearcn oenencs 
RNA 1023 


0.5 


83786 Kidnev Ca Nuclear ffrade 1 
rOD04338^ 


9.6 


ciaaaer \^ancer xxn vxiivwoxun 
A302173 


0.9 


83787 Kidnev NAT COD04338V 


0.3 


Nomial Stomach GENPAK 
061017 


0.6 


83788 Kidnev Ca Nuclear grade 
1/2 fOD04339^ 


0.0 


Gastric Cancer Clontech 9060397 


2.0 


83789 Kidnev NAT ('OD04339^ 


0.9 


NAT Stomach Clontech 9060396 


0.6 


83790 Kidnev Ca. Clear cell tvoe 
rOD04340V 


0.0 


Gastric Cancer Clontech 9060395 


0.2 


83791 Kidnev NAT rOD04340^ 


1.2 


NAT Stomach Clontech 9060394 


2.3 


83792 Kidnev Ca. Nuclear |jrade 3 
rOD04348^ ; 


LI 


Gastric Cancer GENPAK 064005 


1.4 



Table 38 . Panel 4D 



Tissue Name 


Relative Expression(%) 


4dx4tm4455t 
ag2056 al 


4dx4tm4982t 
ag2056 al 


93768 Secondary Thl anti-CD28/anti-CD3 


0.0 : 


0.0 


93769 Secondary Th2 anti-CD28/anti-CD3 . 


0.3 


0.5 


93770 Secondary Trl anti«.CD28/anti-CD3 


o:o 


0.0 


93573 Secondary Thl resting day 4-6 in IL-2 


0.0 


0.0 


93572 Secondary Th2 resting day 4-6 in IL-2 


0.0 


0.0 


93571 Secondary Trl._resting day 4-6 in IL-2 


0.0 ^ 


0.0 


93568 primary. Thl anti-CD28/anti-CD3 


0.4 


0.0 


93569 primary .Th2 anti-CD28/anti-CD3 . . 


0.0 ' 


0.0 


93570 primary Trl anti-CD28/anti-CD3 


0.4 


.0.0 


93565 primary Thl resting dy 4-6 in IL-2 


0.0 


0.0 


93566 primary Th2_ resting dy 4-6 in IL-2 


0.0 


0.0 


93567 primary Trl resting dy 4-6.in IL-2 


0.0 


0.0 - 


93351 CD45RA CD4 lymphocyte anti-CD28/anti-CD3 . 


. 0.0 


. 0.0. . 


93352 CD45RO CD4 lymphocyte anti-CD28/anti-CD3 


0.0. 


0.4... 


93251 CD8 Lymphocytes . anti-CD28/anti-CI)3 / . . 


.:.o.o ; . . 


• 0.0 / , 
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93353^chroiiic CD8 Lymphocytes 2ry..restmg !dy 4^6 in IL-2 


0.0 


0.0 


93574..chroDic CDS Lymphocytes 2ry activated CD3/CD28 . 


0.3 


0.7 


93354 CD4 none 


0.0 


0.0 


93252 Secondary Thl/Th2/Trl anti.CD95 CHI 1 


0.0 


0.0 


. 93103 LAK cells resting 


0.4 


1.7 


93788 LAK cells IL-2 


0.0 


0.0 


93787 LAKceUs IL-2+IL-12 


0.3 


0.0 


93789 LAK cells IL-2+IFN gamma 


0.4 


0.0 


93790 LAK cells IL-2+IL-18 


0.0 


0.0 


93104 LAK cells PMA/ionomyc2nandIL-18 


0.2 


0.3 


93578_NK Cells IL-2 resting 


0.0 


0.0 


93109 Mixed Lymphocyte Reaction Two Way MLR 


0.3 


0.0 


93 11 0 Mixed Lymphocyte Reaction Two Way MLR 


0.5 


1.7 


93111 Mixed Lymphocyte Reaction Two Way MLR 


0.4 


0.6 


93112 Mononuclear Cells (PBMCs) resting 


0.4 


0.0 


93113 Mononuclear CeUs (PBMCs) PWM 


0.0 


0.0 


93114 Mononuclear Cells (PBMCs) PHA-L 


0.0 


0.0 


93249 Ramos (B cell) none 


0.0 


0.0 


93250 Ramos (B cell) ionomycin 


0.0 


0.0 


93349 B lymphocytes PWM 


0.0 


0.0 


93350_B lymphoytes CD40LandIL-4 


0.0 


0.0 


92665 EOL-1 (Eosinophil) dbcAMP differentiated 


0.0 


0.0 


93248_EOL-l (Eosinophil) dbcAMP/PMAionomvcin 


0.0 


0,0 


93356 Dendritic Cells none 


0.0 


0.6 


93355 Dendritic Cells LPS 100 ne/ml 


0.3 


0.7 


93775 Dendritic Cells anti-CD40 


0.0 


0.0 


93774 Monocytes resting 


0.0 


0.0 


93776_ Monocytes LPS50 ng/ml 


0.0 


0.0 


93581 Macrophages resting 


1.1 


0.6 


93582_Macrophages LPS 100 ng/ml 


4.8 


4.4 


93098 HUVEC (Endothelial) none 


0.0 


0.0 ' 


93099 HUVEC (Endothelial) starved 


0.0 


0,7 


93100 HUVEC (Endothelial) IL-lb 


0.0 


0.0 


93779 HUVEC (EndotheUal) IFN gamma 


0.0 


0.5 


93102 HUVEC (Endothelial) TNF alpha + IFN gamma 


0.0 


0.0 


93101 HUVEC (Endothehal) TNF alpha + IL4 


0.0 


0.0 


93781 HUVEC (Endothehal) DL-ll 


0.8 


0.5. 


93583 Lung Microvascular Endothelial Cells none 


0.0 


0.0 


93584_Lung Microvascular Endothehal Cells TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


0.0 


0.0 


92662 Microvascular Dermal endothelium none 


0.7 


0.0 


92663 Microsvasular Dermal endothehmn TNFa (4 ng/ml) and ILlb 
(1 ng/ml) 


0.4 


0.0 • 


93773 Bronchial epifhehum TNFa (4 ng/ml) and ILlb (1 ng/ml) ** 


4.3 ' 


44.0 . 


93347 Small Airway EpitheUvmi none 


• 1.6 


6.1 


93348 Small Airway Epithelium TNFa (4 np/ml) and ILlb (1 ng/ml) 


17.8 


18.0 
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92668 Coronery Artery SMG resting' 


0.0 


1.3 


92669 Coronery Artery SMC TNFa (4 ng/ml) and ILlb (1 ng/ml) 


1.0 


./Al.l 


93107 astrocytes resting 


0.0 


0.0 


93 108_ astrocytes, TNFa (4 ng/ml) and ILlb (1 ng/ml) 


0.4 


0.0 


92666 KU-8 12 (Basophil) resting 


0.8 


0.4 


92667_KU-812 (Basophil) PMA/ionoycin 


0.4 


1.1 


93579 CCD1106 (Keratinocytes) none 


6.4 


9.7 


93580 CCDl 106 (Keratinocytes) TNFa and IFNg ** 


0.0 


13.3 


93791 Liver Cirrhosis > 


0.1 


1.4 


93792 Lupus Kidney 


1.6 


1.7 


93577 NCI-H292 


37.5 


35.2 


93358 NCI-H292 IL-4 


19.8 


19.1 


93360 Na-H292 IL-9 


31.1 


31,3 


93359 Na-H292 11^13 


8.6 


9.7 . 


93357 NCI-H292 IFN gamma 


9.6 


10.7 


93777 HPAEC - 


0.0 


0.0 


93778 HPAEC IL-1 beta/TNA alpha 


0.0 


0.0 


^ C A "V T _ . - - . -. 1 TT _ . . _ . X T^*1_ 11 J- 

93254 Normal Hiunan Lung Fibroblast none 


33.5 


41.6 . 


9525"^ l^OTTnal TTiiman T una WhrnWncf T^KTRa fA -nn/vnW a-nA TT lU /i 
j^mj^^jmj x^xjxincLt JLA.IXXM.MU. 1 rfiii 1^ X^XI/AUUlool X Jl^ix^a I*r ngrnUI SUxQ. xJ^xU 11 

ng/ml) 


16.3 


25.0 


93257 Normal Hunaan Limg Fibroblast IL-4 


77.1 


77.1 


93256_Normal Human Lung Fibroblast IL-9 


59.5 


68.4 


93255 Normal Human Lung Fibroblast IL-13 


51.1 


69.2 


93258 Normal Human Limg Fibroblast IFN gamma 


100.0. V 


100.0 


93106 Dermal Fibroblasts CCD 1070 resting 


0.0 


1.4 


93361 Dermal Fibroblasts CCD1070 TNF alnha 4 ns/ml 


0.0 


0.0 


93105_Demial Fibroblasts CCD1070 EL-l beta 1 ne/ml 


0.0 


1.4 


93772 dermal fibroblast IFN gamma 


8.7 


'8.4 


93771 dermal fibroblast IL^ 


16.4 


25 3 ' 


93260 IBDCohtis2 , . 


0,8 


0.0 


93261 IBDCrohns 


2.1 


0.0 


735010 Colon normal ^ * 


11.9 


13.2 


735019 Lung none 


8.1 


5.2. 


64028-l_Thymus none 


' 16.6 


20.6 


64030-1 Kidney none 


0.9 


1.2 



Table 39 . Panel CNS 1 



Tissue Name 


Relative 
Expression(%) 




Relative 
Expression(%) 


cnslx4tm6169t_ 
ag2056 a2 


cnslx4tm6169t 
ag2056 a2 


102633 BA4 Control 


17.6 


102605 BA17PSP 


31.8 


102641 BA4Control2 


36.2 


102612 BA17PSP2 


7.4 


102625 BA4 Alzheimer»s2 


15.0 


102637 Sub Nigra Control 


9.2 


102649 BA4 Parkinson's 


72.0 ^ 


102645 Sub Nigra Contfol2 


17.2 


102656_BA4 Parkinson's2 


93.7 


102629 Sub Nigra Alzheimer's! 


4.8 
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102664_BA4 Huntmgton's 


28.3 


102660 Sub Nigra Parkinson's2 


13.6 


102671 BA4Huntmgton's2 


11.5 


102667 Sub Nigra Huntington's 


15.1 


102603 BA4PSP 


12.3 


102674 Sub Nigra Huntiiigton's2 


14.4 


102610 BA4PSP2 


22.4 


102614 Sub Nigra PSP2 


0.9 


102588 _BA4 Depression 


213 


102592 Sub Nigra Depression 


0.3 


102596 BA4 Depressioii2 


9.2 


102599 Sub Nigra Depression2 


2.9 


102634 BA7 Control 




vjrioD x^aiiaaus tw^onuOi 




102642 BA7 Control2 




x\}Z.\y\H OlOD l^aiiaQuS V^OnuOlZ 


/.J 


102626 BA7 Alzheimer's2 


12.0 


102620 Glob Palladus Alzheimer's 


. 2.1 


102650 BA7 ParldTi<3on'^ 




102628_GlobPa]ladus 

rvizneuucFSZ 


Q 1 
O.l 


1026S7 "BA? ParldwftTi'O 


^0 7 


1 uzoDz olOD rauaaus Jrarkinson s 


72.2 


102665 _BA7 Huntmgton's 


58.6 


102659_Glob Palladus 
Parkinson's2 


8.6 


102672_BA7 Himtington's2 


55.5 . 


1 02606. Glob Palladus PSP . 


0.8 


102604 BA7PSP 


38.8 


102613 Glob Palladus PSP2 


5.3 . 


102611 BA7PSP2 


18.4 


102591 Glob Palladus Depression 


1.1.- 


102589 BA7 Depression 


13.5 


102638 .Temp Pole Control 


10.8 


102632 BA9 Control 


33.7 


102646 Tenjp Pole Control2 


34.8 


102640 BA9 Control2 


83.4 


102622 Temp Pole Alzheimer's 


7.8 


102617 BA9 Alzheimer's 


4.7 


102630 Temp Pole Alzheinier's2 


7.9 


102624 BA9 Al2heimer»s2 


30.0 


102653 Temp Pole Parkinson's 


43.0 


102648 BA9 Parkinson's 


68.7 


102661 Temp Pole Parkinson's2 


51.6 


102655 BA9 Parkmson*s2 


51.0 


102668 Temp Pole Huntington's 


46.9 


102663 B A9 HimtinRton's 


50.7 


102607 Temp Pole PSP 


11.8 


102670 BA9 Huntmgton's2 


25.1 


102615 Temp Pole PSP2 


12.9 


102602 BA9 PSP 


23.7 


102600 Temp Pole Depression2 


14.1 


102609 BA9PSP2 


6.3 


102639 Cing Gyr Control 


56.7 


102587 BA9 Depression 


9.5 


102647 Cing Gyr Control2 


63.0 


102595 BA9 Depression2 


17.0 


102623 Cing Gyr Alzheimer's 


11.5 


102635 BA17 Control 


63.0 


102631 Cing Gyr Alzheimer's2 


12.7 


102643 BA17Conlrol2 


61^8 


1026S4 Cing Gyr Parkinson's 


25.6 


102627 BA17 Al2heinier's2 


14.8 


102662 Cing Gyr Parkinson's2 


19.8 


102651 BAl 7 Parkinson's 


64.3 


102669 Cing Gyr Huntington's 


35.6 


102658 BA17 Parkinson's2 


100.0 


102676 Cing Gyr Huntington's2 


17.3 


102666 BAl 7 Huntington's 


53.9 


102608 Cing Gyr PSP 


12:4 


102673 BA17Huntinfiton's2 


22.0 


102616 Cing Gyr PSP2 


3.9 


102590. BA17 Depression 


9.8 


102594 Cing Gyr Depression 


10.5 


102597 BA17 Depressi0n2 ( 


40.5 


102601 Cing Gyr Depression2 


13.9 



Table 40 . Panel CNS_neurodegeneration_vl.O 



Tissue Name 


Relative 
Expression(%) 


Tissue Name 


Relative 
Expression(%) 


tm7005t 
ag2056 bl s2 


tm7005t , 
ag2056 bl s2 


AD 1 Hippo 


9.7 


Control (Path) 3 Temporal Ctx 


2.6 
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AD 2 Hippo . ; : : 


19.2 


Control (Path) 4 Temporal Ctx 


31.2 


AD 3 Hippo 


3.8 


AD 1 Occipital Ctx 


9.8 


AD 4 Hippo 


6.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


3.6 


AD 6 Hippo 


26.9 


AD 4 Occipital Ctx 


12.5- 


Control 2 Hippo 


14.3 


AD 5 Occipital Ctx 


11.1 


Control 4 Hippo 


4.7 


AD 6 Occipital Ctx 


27.6 


Control (Path) 3 Hippo 


3.2 


Control 1 Occipital Ctx 


1.4 


AD 1 Temporal Ctx 


6.6 


Control 2 Occipital Ctx 


44.9 


AD 2 Temporal Ctx 


23.5 


Control 3 Occipital Ctx 


19.6 


AD 3 Temporal Ctx 


9.0 


Control 4 Occipital Ctx 


3.6 


AD 4 Temporal Ctx 


19.4 


Control (Path) 1 Occipital Ctx 


62.4 


AD 5 Inf Temporal Ctx 


74.9 


Control (Path) 2 Occipital Ctx 


16.9 


AD 5 SupTemporal Ctx 


38.2 


Control (Path) 3 Occipital Ctx 


1.4 


AD 6 Inf Temporal Ctx 


28.5 


Control (Path) 4 Occipital Ctx 


24.9 


AD 6 Sup Temporal Ctx 


26.6 


Control 1 Parietal Ctx 


4.8 


Control 1 Temporal Ctx 


3.7 


Control 2 Parietal Ctx 


• 29.0 


Control 2 Temporal Ctx 


24.9 


Control 3 Parietal Ctx 


16.5 


Control 3 Temporal Ctx 


10.4 


Control (Path) 1 Parietal Ctx 


54.5 


Control 4 Temporal Ctx 


9.7 


Control (Path) 2 Parietal Ctx 


14.2 


Control (Path) 1 Temporal Ctx 


37.8 


Control (Path) 3 Parietal Cfe 


2.3 


Control (Path) 2 Temporal Ctx 


27.2 


Control (Path) 4 Parietal Ctx 


45.9 



Panel 1.3D Summary The NOV6 gene is a homolog of synaptotagmin, and shows 
moderate to high expression across all brain regions with highest expression in the cerebral 
cortex (CT = 27.6) Synaptotagmin is a presynaptic protein involved in synaptic vesicle release, 
making this an ideal dmg target for diseases such as epilepsy, in which reduction of 
neurotransmission is beneficial. Selective inhibition of this gene or its protein product may 
therefore be useful in the treatment of seizure disorders. Furthermore, selective inhibition of 
neural transmission throu^ antagonism of the protein encoded by the NOV6 gene may show 
therapeutic benefit in psychiatric diseases where it is believed that inappropriate neural 
connections have been established, such as schizophrenia iand bipolar disorder. In addition, 
antibodies against synaptotagmin may cause Lambert-Eaton myasthenic syndrome. Therefore, 
peptide fragments of the protein encoded by the NOV6 gene may serve to block the action of 
these antibodies and treat Lambert-Eaton myasthenic syndrome. 

The NOV6 gene also shows low but significant expression in many metabohc tissues 
including adipose, adult and fetal heart, adult and fetal hver, pancreas, and the adrenal, 
pituitary and thyroid glands. This gene product appears to be expressed at much higher levels 
in fetal skeletal muscle (CT value = 31) when compared to adult skeletal muscle (CT value = 
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. 37)3 and may be useful for, the .differentiation of the adult from the fetal^phenotype in this 

tissue. " ' 

The NOV6 gene is significantly expressed in a cluster of cell lines derived from lung, 
gastric, colon and ovarian cancer compared to the normal tissues. The expression of tibis gene 
5 also shows an association with some normal brain and prostate samples when compared to the 
cell lines derived from cancers of these tissues. Thus, based upon its profile, the expression of 
this gCTie could be of use as a marker or as a therapeutic for lung, gastric, colon and ovarian 
cancers. In addition, therapeutic modulation of the product of this gene, through the use of 
peptides, antibodies, chimeric molecules or small molecule drugs, may be useful in the 

10 treatment of these cancers. 

- ; Panel 2.2 Summary Expression of the NOV6 gene is highest in a breast cancer 

metastasis (CT=27.8) and appears to be highly expressed in samples derived from breast 
cancer when compared to normal adjacent tissue. The expression of this gene also shows an 
- ■' association with some normal kidney, prostate and lung samples when compared to the 

15 matched kidney, prostate and Ixmg cancer tissue^ Thus, based upon its profile, 

absence/presence of e7q>ression of this gene could be of use as a marker for breast, kidney, 
prostate and lung cancer. Therapeutic modulation of the product of this gene, through the use ' 
of peptides, antibodies, chimeric molecules or small molecule dmgs, may be usefid in the 
therapy of lung, kidney, prostate and breast cancers. 

20 Panel 4D Sunmiary Results from two experiments wifli the same probe and primer set 

show that the NOV6 gene is selectively expressed, at moderate levels, in lung related tissues. 
Expression of the gene is found on normal human lung fibroblast and is up regulated in these 
cells following treatment with IFNg, IL4, IL13 and IL-9, with highest expression in IFNg 
treated cells (CTs=30). The protein encoded by the NOV6 gene is also up regulated in small 

25 airway epithelium treated with TNF-a and IL-lb and downregulated in the muco-epidennoid 
cell line H292 upon treatment with XL- 13 and IFNg. The NOV6 gene is a homolog of 
synaptotagmin, whose ubiquitously expressed isoform, synaptotagmin VH, regulates 
exocytosis of lysosomes. Synaptotagmin VIE has recently been imphcated in fibroblast plasma 
membrane repair along with lysosomes which act as Ca(2+)-regulated exocytic compliments 

30 responsible for the plasma membrane repair. Therefore, therapeutic modulation of the 

expression or function of this gene or gene product, through the use of antibodies or small . 
molecule dmgs, might be beneficial for treating limg diseases such as asthma, emphysema, 
and vhral and bacterial Itmg infection associated with cellular stress due to the local production 
of inflanamatory cytokines. 
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, Panel CNS_1 Summary Highest expression of the NOV6 gene is seen m the brain of 
a patient with Parkinson's disease (CT=29.6). Please see Panel 1.3D for a discussion of 
potential utility in the central nervous system. • 

Panel CNS_neurodegeneration_vl.O Summary Expression of the NOV6 gene is 
ubiquitous throughout the samples in this panel, with highest expression in the hippocampus 
of a patient with Alzheimer's disease (CT=25.8). While no association between the repression 
of this gene and the presence of Alzheimer's disease is detected in this panel, these results 
confiixn the expression of this gene in areas that degenerate in Alzheimer's disease, including 
the cortex, hippocampus, amygdala and thalamus. Synaptotagmin expression is altered in the 
brain of Alzheimer's patients, possibly explaining impaired synaptogenesis and/or 
synaptosomal loss secondary to neuronal loss observed in the neurodegenerative disorder. It 
may also represent, reflect or account for the impaired neuronal transmission in Alzheimer's 
disease (AD), caused by deterioration of the exocytic machinery. Since the N0V6 gene is a 
homolog of syn^totagmin, agents that potentiate the expression or function of the protein 
15 encoded by the NOV6 gene may be useful in the treatment of Alzheimer's disease. (Reddy et 
al.. Plasma membrane repair is mediated by Ca(2+)-regulated exocytosis of lysosomes.CeU 
106:157-69, 2001; Takamori et al.. Antibodies to calcium channel and synaptotagmin in 
Lambert-Eaton myasthenic syndrome. Am J Med Sd. 319:204-8, 2000; Sze et al.. Selective 
regional loss of exocytotic presyn^tic vesicle proteins in Alzheimer's disease brains. J Neurol 
Sci. 175:81-90. 2000; Sokolov et al.. Levels of mRNAs encoding synaptic vesicle and syn^tic 
plasma membrane proteins in the temporal cortex of elderly schizophrenic patients. Biol 
Psychiatry. 48:184-96, 2000; MasUah et aL, Altered expression of synaptic proteins, occurs 
eiarly during progression of Alzheimei's disease. Neurology 56:127-9, 2001; Yoo et al., 
Jy^^^m^\prot^h beta-soluble N-elhyhnaleimide^sensitive factor attachment protein 
(beta-.SNAP), gamma-SNAP and synaptotagmin I in brain of patients with Down syndrome 
and Alzheimor's disease. Dement Geriatr Cogn Disord. 12:219-25, 2001). 

NOV8: Glypican 2 Precursor-like 
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Expression of the NOV8a gene (134913441_EXT) and variants NOV8b (CG50970-02) 
andNOVSc (CG50970-03) was assessed using the primer-probe sets Agl309 and Ag2251 
described in Tables 41 and 42. Results ftom RTQ-PCR runs are shown in Tables 43, 44, 45, 



and 46. 

Table 41 . Probe Name Agl309 . 
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Sequences 


TM 


Length. 


Start 
Position 


SEQ ZD 
NO: 


Forward 


5 ' -ACTCTCTGACCCAGCTCTTCTC-3 ' 


59.3 


22 


412 


126 


Probe 


TET-5 • -CCACTCCTACGGCCGCCTGTATG-3 » - 
TAMRA 


70.6 


23 


434 


127 


Reverse 


5 ' -GAGAACAGGCCATTGAATATGA-3 ' 


59 


22 


469 


128 



Table 42 . Probe Name Ag2251 



Primers 


Sequences 


TM 


Length 


Start 
Position 


SEQ XD 
NO: 


Forward 


5 ' - ACTCTCTGACCCAGCTCTTCTC-3 • 


59.3 


22 


359 


129 


Probe 


TET-5 ' -CCACTCCTACGGCCGCCTGTATG-3 ' - 
TAMRA 


70.6 


23 . 


381 


130 


Reverse 


5 » -GAGAACAGGCCATTGAATATGA-3 • 


59 


22 


416 


131 



Table 43. Panel 1.3D 





Relative 
Expression(%) 




Relative 
Expres5ion(%) 


Tissue Name 


13dtm4197t_ 
a&2251 


Tissue Name 


13dtm4197t_ 


liver adenocarcinoma 


0.9 


Kidaev (fetal) 


1 9 


Pancreas 


0.4 


Renal ca. 786-0 


1.0 


Pancreatic ca. CAPAN 2 


0.4 


Renal ca. A498 


4 5 


Adrenal gland 


0.6 


Renal ca. RXF 393 


0 0 


Thyroid 


0.4 


Renal ca. ACHN 


0 3 


Salivary gland 


1.2 


Renal ca. UO-3 1 


2 8 


Pituitary gland 


0.7 


Renal ca. TK-10 


3 8 


Brain (fetal) 


73.7 


Liver 


0.0 • 


Brain (whole) 


4.6 


liver (fetal) 


1.7 


Brain (amygdala) 


6.4 


Liver ca. (hepatoblast) HepG2 


1.8 . 


Brain (cerebellum) 


1.8 


Lung 


0.0 


Brain (hippocampus) 


22.2 


Lung (fetal) 


' 3.1 


Brain (substantia nigra) 


2.1. 


Lung ca. (small cell) LX-1 


4.5 . 


Brain (thalamus) 


4.5 


Lung ca. (small cell) NCI-H69 . 


8.7 


Cerebral Cortex 


3.5 


Lung ca. (s.cell var.) SHP-77 


25.7 


Spinal cord 


3.2 


Lung ca. (large cell)NCI-H460 


2.5 


CNS ca. (glio/astro) U87-MG 


4.3 


Lung ca. (non-sm. cell) A549 


2.8 


CNS ca. (glio/astro) U-118-MG 


2.2 


Lung ca. (non-s.cell) NCI-H23 


12.4 


CNS ca. (astro) SW1783 


14.3 


Lung ca (non-s.cell) HOP-62 


1.7 


CNS ca * (neuro; met) SK-N-AS 


100.0 


Lung ca. (non-s.cl) NCI-H522 


28.1 


CNS ca. (astro) SF-539 


0.5 


Lung ca. (squam.) SW 900 


2.1 


CNS ca. (astro) SNB-75 


13.0 


Lung ca. (squam.) NCI-H596 


0.7 


CNS ca. (glio) SNB-19 


14.7 


Nf ammary gland 


1.0 


CNSca, (gIio)U251 


3.6 


Breast ca.* (pi. ejHusion) MC:3F-7 


4.0 


CNS ca. (gUo) SF-295 


3.6 


Breast ca.* (pl.ef) MDA-MB-231 


1.1 


Heart (fetal) 


3.4 


Breast ca.* (pi. effusion) T47D 


1.1 


Heart 


0.0 


Breast ca. BT-549 


16.3 
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- Fetal Skeletal - 


15.2 


Breast ca:MDA-N' 


6.4 


Skeletal muscle ^ 


0.0 


Ovary 


' 3 .2 


Bone marrow 


1.8 


Ovarian ca. OVCAR-3 


1.7 


Thymus 


21.2 


Ovarian ca. OVCAR-4 


6.8 


Spleen 


0.8 


Ovarian ca. OVCAR-5 


2.3 ' 


Lymph node 


1.1 


Ovarian ca. OVCAR-8 


7.3 


Colorectal 


0.8 


Ovarian ca. IGROV- 1 


2.4 


Stomach 


0.6 


Ovarian ca.* (ascites) SK-OV-3 


0.6 


. Small intestine 


2.6 


Uterus 


0.8 


Colon ca. SW480 


2.5 


Placenta 


0.8 


Colon ca.* (SW480 inet)SW620 


L5 


Prostate 


1.1 


Colon ca. HT29 


1.7 


rrosiaie ca.^ (^Done metjrO-3 


3.2 


Colon ca. HCT-116 


2.4 




69.7 


Colon ca. CaCo-2 


2 5 


Lvieianoma xisooolA). 1 


0.0 


83219 CC Well to Mod Diff 
(OB03S66) 


2.2 


Melanoma* (met) Hs688(B).T 


0.0 


Colon ca, HCC-2998 


2.0 


Melanoma UACC-62 


0.4 


Lrastnc ca.* (liver met) NCI-N87 


0.8 


Melanoma iM14 


2.6 


Bladder 


1.0 


Melanoma LOX IMVI 


0.7 


Trachea 


1.8 


Melanoma* (met) SK-MEL-5 


5.6 


Kidney 


0.7 


i^Ldipose 


0.0 



Table 44 . Panel 2D 



Tissue. Name 


Relative 
Expression(%) 




Relative 
Expression(%) 


2dtm4198t 
a&2251 


Tissue Name 


2dtm4198t 
ag2251 


Normal Colon GENPAK 061003 


5.5 


Kidney NAT Clontech 8120608 


0.0 


83219 CC Well to Mod Diff 
(OD03866) 


. 4.5 


Kidney Cancer Clontech 8120613 


6.0 


83220 CC NAT rODO:^Rfifi^ 


2.6 


Kidney NAT Clontech 8120614 


0.6 


83221 CC Gr.2 rectosiemnirf 
rOD03868) 


... 1.2 


Kidney Cancer Clontech 9010320 


: 0.0 


83222 CC NAT rODO.^R6«^ 


1.1 


Kidney NAT Clontech 90 1 032 1 


1.3 


83235 CC Mod Diff ronmQ7m 


5.8 


Normal Uterus GENPAK 061018 


1.1 


83236 CC NAT rODO3920^ 


2.3 


Uteras Cancer GENPAK 06401 1 


3.0 


83237 CC Gr.2 ascend rnlnn 
fOD03921) 


4.1 


Normal Thyroid Clontech A+ 
6570-1 


0.6 


83238 CCNAT (0D0^9?.n 


0.0 


Thyroid Cancer GENPAK 064010 


0.6 


83241 CC from Partial 
Henatectomv (ODO4300) 


1.3 


Thyroid Cancer INVITROGEN 
A302152 


0.4 


83242 Liver NAT (0D04^nQ^ 


0.0 


Thyroid NAT INVITROGEN 

A302153 


2.3 


87472 Colon mets to Irnip 
rOD04451-0n . 


4.3 


Nformal Breast GENPAK 061019 


4.4 


87473 Lnnp NAT rOD044SlU)9^ 


0.0 \ 


34877 Breast Cancer rOD04566^ 


1.2 


Nomial Prostate Clontech A+ 
6546-1 


\ 

0.0 ( 


35975 Breast Cancer rOD04590- 

m 


100.0 


84140 Prostate Cancer rOD044] 0^ | 


3.4 . { 


^5976JBreast Cancer Mete 


"1.5- ' 
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.... 




fOD04590-03Y 




84141 Prostate NAT fOD04410^ 


0.0 


87070 Breast Cancer Metastasis 
fOD04655-05> 


3.7 


87073 Prostate Cancer fOD04720- 

on 


0.6 


GENPAK Breast Cancer 064006 


6,8 


87074 Prostate NAT f OD04720- 
021 


1.8 


Breast Cancer Res. Gen. 1024 


10.4 


Normal Lung GENPAK 061010 


5.1 


Breast Cancer Clontech 9100266 


6.6 


8*^239 Lunp Met to Muscle 
raD042861 


0.0 


Breast NAT Clontech 9100265 


3.4 


83240 Muscle NAT fOD04286^ 


0.6 


Breast Cancer INVITROGEN 
A209073 


7,9 


84136 Lung Malignant Cancer 
rOD03126^ 


3,9 


Breast NAT INVITROGEN 
A2090734 


2.5 


84137 Lune NAT fOD03126> 


0,0 


Normal Liver GENPAK 061009 


0.0 ' 


84871 Lune Cancer rOD04404^ 


0.0 


Liver Cancer GENPAK 064003 


0.6 


84872 Luns NAT rOD04404^ 


0.6 


Liver Cancer Research Genetics > 
RNA 1025 


6.0 


84875 Lune Cancer rOD04565> 


0.6 


Liver Cancer Research Genetics 
RNA 1026 


0.6 


84876 Lung NAT rOD045651 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004-T 


0.0 


85950 Lune Cancer raD04237.0n 


99.3 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


0.6 


85970 Lung NAT ^0004237-02^ 


2.4 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005-T 


1.1 


83255 Ocular Mel Met to Liver 
rODO4310^ 


0.7 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


0.0 


83256 Liver NAT r6DO4310^ 


0.0 


Normal Bladder GENPAK 061001 


1.8 




18.0 


Bladder Cancer Research Genetics 

"D XT A 1 rvo 1 


2.8 \ 


84138 Lune NAT rOD0432n 


0.6 


Diaacier cancer UN vi i kuCjiIN 
A302173 


13.2 V 


Normal Kidney GENPAK 061008 


1.4 


87071 Bladder Cancer rOD04718- 

on 


0.0 


83786 Kidnev Ca. Nuclear grade 2 


8.0 


87072 Bladder Normal Adiacent 


1.3 


rOD04338'^ 


fOD04718-03> 


83787 Kidnev NAT rOD04338^ 


0.0 


Normal Ovary Res. Gen. 


2.8 


1/2 (0004339^ 


2.4 


Ovarian Cancer GENPAK 064008 


.4.3 


83789 Kidnev NAT COD04339^ 


0,0 


5 /492 Uvarv Cancer (OD04768- 
071 


'4.0 


33790 Kidnev Ca Clear cell tvnp 
rOD04340^ 


1.2 


87493 OvarvNATrOD04768-08'» 


0.0 


83791 Kidnev NAT f OD04340^ 


1.0 


Normal Stomach GENPAK 
061017 


0.8 


83792 Kidnev Ca. Nuclear erade 3 
fOD04348^ 


0.0 


Gastric Cancer Clontech 9060358 


0.3 


83793 Kidnev NAT raD04348^ 


0.8 


NAT Stomach Clontech 9060359 


1.2 


87474 Kidnev Cancer raD04622. 

on 


1.1 . 


Gastric Cancer Clontech 9060395 


0.0 




0.0 . 


NAT Stomach Clontech 9060394 


1.5 


85973 Kidnev Cancer fOD04450- 

on 




Gastric Cancer Clontech 9060397 


6.8 : 


85974 Kidnev NAT roiD04450-0^^ 


0.6 ] 


^JAT Stomach Clontech 9060396 


0.0 


Kidney Cancer Clontech 8120607 


0.6 < 


Gastric Cancer GENPAK 064005 


2.5 
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Table 45. Panel 4D 








JvcldUVc 

Expression(%) 


jK.eiauve 
Expresslon(%) 


Tissue Name 


4dtin4199t 
ae2251 


4Dtml886t 
agl309 


93768 Secondary Thl anti-CD28/anti-CD3 


1.6 


1.5 


93769_Secondary Th2 anti-CD28/anti-CD3 


IJ. 


1.0 


93770 Secondary Trl anti-CD28/anti-CD3 


1.7 


2.0 


93573 Secondary Thl resting day 4-6 in IL-2 


0.5 


1.7- . 


93572 Secondary Th2 resting day 4-6 in IL-2 


0.6 


1.4 


93571 Secondary Trl resting day 4-6 in IL-2 


1.2 


1.4 


93568 primary Thl anti-CD28/anti-CD3 


2.7 


1.7 


93569 primary Th2 anti-CD28/anti-CD3 


1.9 


3.4 


93570 primary Trl anti-CD28/anti-CD3 


1.2 


5.9 


93565 primary Thl resting dy 4r6 in IL-2 


17.1 


12.5 


93566 primary Th2 resting dy 4-6 in IL-2 


8.6 


6.5 ' 


93567 primary Trl resting dy 4-6 in IL-2 


2.4 


. .3.7 . 


93351 GD45RA CD4 lymphocyte anti-CD28/anti-CD3 


1.2 


3.2 


93352 CD45RO CD4 lymphocyte anti-CD28/anti-CD3 


2.7 


4.3 


93251 CDS Lymphocytes anti-CD28/anti-CD3 


1.9 


1.1 


93353 chronic CDS Lymphocytes 2ry resting dy 4-6 in IL-2 


0.9 


1.7 


93574 chronic CD8 Lymphocytes 2ry activated CD3/CD28 


1.0 . 


1.1 


93354 CD4 none 


0.5 


1.4 


93252 Secondary Thl/Th2/Trl anti-CD95 CHll 


1.7 


4.5 


93103 LAK cells restmg 


1.6 


1.2 


93788 LAK cells IL-2 


1.8 


3.1 


93787 LAK cells IL-2-ML-12 


0.7 


1.8 


93789 LAK cells IL-2+IFN gamma 


13 


1.7 


93790 LAK cells IL-2+IL-18 


1.4 


1.5 


93104 LAK cells PMA/ionomycm andIL-I8 


, 0.0 


0.8 


93578 NK Cells IL-2 restmg 


1.1 


0.9 


93109 Mixed Lymphocyte Reaction Two Way MLR 


2.3 


1.6 


931 10 Mixed Lymphocyte Reaction Two Way MLR 


0.3 


1.7. 


93 1 1 1_ Mixed Lymphocyte Reaction Two Way MLR 


0.4 


0.8 


931 12 Mononuclear Cells (PBMCs) resting 


0.0 


0.4 


93113 Mononuclear Cells (PBMCs) PWM 


2.1 


6.1 . 


93114 Mononuclear Cells (PBMCs) PHA-L 


5.7, 


9.9 


93249 Ramos (B cell) none 


6.2 


13.6 


93250 Rarnn<? ^B cell^ innnmvrin 


24.3 


34.9 


93349 B lymphocytes PWM 


7.3 


7.4 - 


93350 B lymphoytes CD40L and IL-4 


4.4 


■ 2.7 


92665 EOL-1 (Eosinophil) dbcAMP differentiated 


2.3 


2.6 


93248 EOL-1 (Eosinophil) dbcAMP/PMAionomycin 


1.3 


0.3 ' \ 


93356 Dendritic Cells none . , 


0.8 


0.5 . 
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93355 Dendritic Cells LPS lOOng/ml 


1 0.0 


1 0.0 


93775 Dendritic Cells anti-CD40 


0.0 


1 0.3 ' 


93774 Monocytes resting 


1 0.0 


1 0.3 


93776 Monocytes LPS 50 ng/nol 


1 0.3 


1 


93581 Macrophages resting 


1 1.3 


j 0.6 


93582_ Macrophages LPS lOOng/ml 


1 0.0 


1 0.0 


93098 HUVEC (Endothelial) none 


1 


1 5.3 


93099 HUVEC (Endothelial) starved 


1 12.4 


1 12.7 


93100 HUVEC (Endothelial) IL-lb 


1 1.6 


1 1.3 


93779 HUVEC (Endothelial) DFN gamma 


I 2.5 


1 2.9 


93 102 HUVEC (Endothelial) TNF alpha + IFN gamma 


1 0.1 


1 ^'^ 


^oiKJi jiu vhnw- (jcnaoLueiiaij i JNJr aipna + IL4 


1 2.6 


1 3.5 


yj / ox xi u V (^Jtinciouieiiai) JLL- 1 1 


1 ^'^ 


1 4.4 


93583 Lung Microvascular Endothelial Cells none 


1 2.3 . 


1 


93584_Lung Microvascular Endothelial Cells TNFa (4 ne/ml) and 


1.7 


2.1 


7ZQD^ j\i2crovascuiar i^ennaj enaotaeliuTn none 


1 2.6 


6.1 


92663 Microsvasular Dermal endothelium TNPn (4 naAnn nnH tt iVi 
(Ing/ml) 


1.3 


1 2.0 


93773 Bronchial epithelium TNFa (4 ng/ml) and ILlb (1 ng/ml) ♦* 


1 1.6 


1 2.9 


93347 Small Airway Epitheliimi none 


1 0.4 


1 0.8 


93348 Small Airway Epitiielium TNFa (4 ng/ml) and ILlb (1 np/mn 


1 


1 3.1 


92668 Coronery Artery SMC resting 


1.1 • 


1 ■ 13 


92669 Coronery Artery SMC TNFa (4 ng/ml) and ILlb (1 ne/mn 


2.0 


1.4 ' 


93107 astrocytes resting 


22.5 


17.8 


9310S astrocytes TNFa (4 ng/ml) and ILlb (1 n^ml) 


4.7 


6.2 


92666 KU-812 (Basophil) resting | 


0.2 1 


0.3 


92667 KU-812 (Basophil) PMA/ionoycin | 


0.3 1 




93579 CC:D1 106 (Keratinocytes) none | 


3.9 j 


3 0- 


93580 CCDl 106 (Keratinocytes) TNFa and IFNg ** j 


3.1 1 


1.7. <j 


93791 Liver Cirrhosis 


2.6 1 




93792 Lupus Kidnev 


0.0 ^' 1 




93577 NCI-H292 ■ *. | 


0.4 1 


0 7 


93358 NCI-H292 IL^ 


0.4 1 


1 7 


93360 NCI-H292 IL-9 


1.6 I 


0 0 


93359 1^CI''H292 IL-13 


1.6 1 


v. V/ 

0.6 


93357 Na-H292 IFN gamma 


0.3 1 


0.0 


93777 HPAEC - 


2.0 


3.3 


93778 HPAEC TI^l beta/TTWA nlnVio 1 


0.6 


1.6 


932^4 'N^rirmnl TTiimoTi T tinrf 17-iV%v«^l«i»..4. 1 

^•j^'j'r i>iuiiijai xxuinan j_jUiig pioroDlast none 1 


3.4 


3.7 


93253 Normal Human Lung Fibroblast TNFa (A r\trlmV\ ^-nA rr ik n 
ng/ml) 


1,5 


1/5 


93257 Normal Human Lung Fibroblast IL-4 


2.8 


3.6 


93256 Normal Human Lung Fibroblast IL-9 


3.2 


2.6 


93255_ Normal Human Lung Fibroblast IL-13 ' 


2.8 


2.7 


93258 -Nomial Human Lung Fibroblast IFT^ gamma 


1.9 - 


0.5 


93106 Demial Fibroblasts CCD1070 resting 


3.7 1 


4.2 
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93361 Dermal Fibroblasts CCD 1070 TNF alpha 4 ng/ml 


.4.2 . 


. 2.4 


93105 Dermal Fibroblasts CCD1070 IL-1 beta 1 ng/ml 


2.5 


. ^ 1.3 


93772 dermal fibroblast IFN gamma 


0.2 


0.7 


93771 dermal fibroblast IL-4 


0.8 


0.7 


93260 IBD Colitis 2 


0.0 


0.2 


93261 IBDCrohns 


0.0 


0.0 


735010 Colon normal 


4.2 


3.1 


735019 Lung none 


1.3 


1.5 


64028-1 Thymus none 


0.3 


1.6 


64030-1 Kidney none 


100.0 


100.0 



Table 46. Panel CNS_neurodegeneration_vl.O 





Relative 
JCiXpressiont yb ] 




Relative 
Expressionf%) 


Tissue Name 


tm6901t_ 
afi2251 a2s2 


Tissue Name 


tmo901t_ 
ag2251 a2s2 


106655 4951 Hippo 


19.7 


106677 4624 BA21 


3.4 • 


106657 4986 Hippo 


35.8 


106681 4640 BA21 


34.1 


106652 4933HiDD6 


7.2 


106654 4951 BA17 


19.5 


106649 4901 Hippo 


9.7 


ens water 


0.0 


110138 3087 hippo 


59.0 


106651 4933 BA17 


17.3 


110121 3027 Hippo 


97.7 


106648 4901 BA17 


19.1 


106670 "RwT^r. 


37.0 


110123 3027 Occ Ctx 


52.8 


106666 4867 ffippo 


34.8 


110140 3087 occ ctx ' ' 


41.9 


106680 4624 Hippo 


17.2 


106659 4595 BA17 


6.3 


106653 4951 BA21 


20.5 


106668 4971 BA17 


51.5, 


106656 4986 BA21 


33.7 


106662 4737 BA17 


22.9 


106650 4933 BA21 


9.8 


106665 4867 BA17 


6.6 


106647 4901 BA21 


36.0 


106675 3975 BA17 


73,6- 


110136 3087 inf temp ctx 


76.8 


106672 3954 BA17 


16.7 


110137 3087 sup temp ctx 


97.7 


106678 4624 BA17 


11.8 


110118 3027 Inf Temp Ctx ^ 


59.8 


106682 4640 BA17 


28.1 


110119 3027 Sup Temp Ctx 


100.0 


106660 4595 BA7 


12.0 


106658 4595 BA21 


9.8 ] 


113670 106669 pool 


62.2 


106667 4971 BA21 


29.9 ] 


106663 4737 BA7 


20.3 


106661 4737 BA21 


10.5 ] 


06676 3975 BA7 


43.7 


106664 4867 BA21 , 


34.1 1 


06673 3954 BA7 ^ 


14.9 


106674 3975 BA21 


63.8 1 


06679 4624 BA7 


7.8 


106671 3954 BA21 


13.7 1 


06683 4640 BA7 


29.8 



Panel 1.3D Summary Ag2251 The highest level of expression of the N0V8 gene is 

seen in a CNS cancer cell line SK-N-AS (CT=29.6). The gene is also expressed at^W^^^^ 
levels in cell lines derived from Imig, prostate, and breast cancers compared to the nomial 
tissues and may play a role in these canc^. Thus, expression of the NOV8 gene could.be used 
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as a marker or as a therapeutic for lung, prostate and breast cancer. In addition^ ttherapeutic 
modulation of the activity of the product of this gene, through the use of peptides, antibodies, 
chimeric molecules or small molecule drugs, may be useful in the treatment of these cancers. 

The NOV8 gene is also expressed at higher levels in fetal liver, limg, skeletal muscle, 
and heart (CTs==32-35) when compared to the expression in adult tissues (CTs=^0). These 
results suggest that expression of the NOV8 gene could potaatially be used to distinguish 
between the adult and fetal phenotypes of these tissues. Furthemiore, the difference in 
e?q>ression in fetal and adult tissue may also indicate an involvement of the gene product in flie 
differentiation processes leading to the formation of the adult organs. Therefore, the protein 
encoded by the NOV8 gene could potentially play a role in the regeneration of these tissues in 
the adult. 

The NOV8 gene, a glypican homolog, is expressed at moderate to low levels across 
many regions of the brain. These regions include the hippocampus, amygdala, thalamus and 
cerebral cortex, all of which are key regions subject to Alzheimer's disease neurodegeneration. 
Furthermore, glypican is expressed in senile plaques and neurofibrillary tangles, also • 
indicating a role in Alzheimer's disease. Therefore, the expression profile of the NOV8 gene 
suggests that antibodies against the protein encoded by the NOV8 gene can be used to 
distinguish neurodegaiCTative disease in flie human brain. Fiurthennore, since NOV8 gene- 
product-like substances are components of senile plaques which are thought to give rise to the 
dementia pathology of Alzheimer's disease, ageaits that target this gene and disrupt its role in 
senile plaques may have utiUty in treating the cause and symptoms or Alzheimer's disejise as 
weU as othCT neurodegenerative diseases that involve this glypican. 

Panel 2D Summary Ag22Sl The highest expression of NOV8 gerieis seen in a breast 
canc«- sample (CT = 30.3). The expression of this gene appears to show an assbciation with 
samples derived from colon, lung, kidney, breast, bladder and gastric cancers when con^ared 
to the matched normal tissue. Thus, expression of the NOV8 gene could be used as a marker 
for these cancers. In addition, therapeutic activity of the product of this gene, through the use 
of peptides, antibodies, chimeric molecules or small molecule drugs, may be useful in the 
treatment of colon, lung, kidney, breast, bladder and gastric cancers. 

Panel 4D Summary Ag22Sl/Agl309 Two experiments using two different probe and 
primer sets produce results that are in very good agreement, with highest e3q)ression seen in 
the kidney (CTs=28-29). This high level of e3q)ression in the kidney suggests that expression 
of the NOV8 gene can serve as a marker for kidney tissue. The NOV8 gene is also expressed 
at low level in activated Ramos B cell line, in activated primary B cells, Thl T cells, activated 
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HUVEC and keratinocytes. The N0V8 gene encodes for a protein that is a homolog of a 
glypican molecule, which belongs to the family of HSPG (heparan sulfate protepglycaiis). 
Glypicans can regulate the activity of a wide variety of growth and survival factors. Th^fore, 
therapeutic modulation of the expression or function of this gene or gene product, through the 
use of antibody drugs could potentially prevait T and B cell activation in the treatment of 
autoimmune mediated diseases such as insulin-dq)endent diabetes mellitus, rheumatoid 
arthritis, Crohn's disease, allergies, delayed type hypersensitivity, asthma, and psoriasis. 

Panel C7<fS_neurodegeneratioii_vl.O Summary A^25iHighest expression of the 
NOV8 gene in this panel is detected in the cerebral cortex of an Alzheimer's patient 
(CT=32.7). While no association between the expression of this gene and the presence of 
Alzheimer's disease is detected in this panel, these results confirm the expression of this gene 
in areas that degenerate in Alzheimer's disease. Please see Panel 1.3D for a discussion of 
potential utility of this gene in the central nervous system. (Verbeek et al., Agrin is a major 
heparan sulfate proteoglycan accumulating in Alzheimer's disease brain! Am J Pathol. 
155:2115-25,1999). 

NOV9: Mitogen-Activtivated-Protein Kinase Kinase 2-Uke 

, : . . Expression of NOV9 gene (also refened to as ACOl 1 005_da2) was assessed using the 
primef-probe set Ag2022 described in Tables 47. Results fi-om RTQ-PCR runs are shown in 
Tables 48, 49, and 50. 

T^le47. Probe Name Ag2022 - 



PrdLmers 


Sequences 


TM 


Length 


Start 
Position ' 


SBQ ID 
Nd: 


Forward 


5 V - CCAGGAGTTTGTCAATAAATGC-3 « 


58.6 . 


22 


800 


132 


Probe . 


FAN 5*-CTCATCAAGAACCCAGCGGAGCG-3'- 
TAMRA- ' - 


71.2 • 


23 


822 


133 • 


Reverse 


5 « -TTGATGAAGGTGTGGTTTGTG-3 ' 


59.5 


21 


863 


134 



Table 48 . Panel 1.3D 



. Tissue Name 


Relative Expression(%) 


1.3dx4tm5437 
f ag2022 bl 


1.3dx4tm5441 
f as2022 al 


Liver adenocarcinoma 


23.1 


15.9 


Pancreas 


9.6 


4.3 


Pancreatic ca. CAPAN 2 


4.1 


. 4.4 


Adrenal gland 


7.9 


10.3 


Thyroid 


12.1 


9.7. 


Salivary gland 


10.9 


5.9 


Pituitary gland 


. 12.0 


9.6,; 
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Brain (fetal) 


13.6 


7.6 


Brain (whole) 


47.3 


25.1 


Brain (amygdala) 


33.7 


19.9 


Brain (cerebellum) 


33.2 


16.3 


Brain (hippocampus) 


42.8 


21.7 


Brain (substantia nigra) 


30.6 


13.8 


Brain (thalamus) 


50.3 


24.6 


Cerebral Cortex 


36.5 


31.4 


Spinal cord 


16.9 


8.7 


CNS ca. (glio/astro) U87-MG 


17.6 


18.5 


CNS ca. (glio/astro) U-1 18-MG 


54.6 


38,5 


CNS ca. (astro) SW1783 


13.5 


12.8 


CNS ca.* (neuro; met ) SK-N-AS 


15.4 


12 9 


CNS ca. (astro) SF-539 . 


14.3 


9.5 


CNS ca. (astro) SNB-75 


29.6 


25.8 


CNS ca. (glio) SNB-19 


23 7 


17 Q 


CNS ca..(glio)U251 


38.4 


34 5 


CNS ca, (glio) SF-295 




17 9 


Heart (fetal) 


17 1 


JIO.D 


Heart 




in ^ 


Fetal Skeletal 




19 0 


Skeletal muscle 


inn A 


inn n 


Bone marrow 


15 0 


14 s 


Thymus 


7 6 


R 1 


Spleen 


14 5 


11/1 

1 I .*T 


Lymph node 


0^ 7 


ICO 


Colorectal 


^ 7 


A 7 


Stonoach 


J.*T.*T 


1 n 1 

lU.l 


Small intestine 






Colon ca. SW480 






Colon ca.* (SW480 met)SW620 




4.1 ■ 


Colon ca. HT29 


1 A 




Colon ca.HCT-1 16 




O 1 ■ 

7.1 


Colon ca. CaCo-2 


5 6 


7 1 


83219 CC Well to Mod Diff rOD0386^n 


1 1 R 

JL l.O 


1 1 < 
1 l.D 


Colon ca. HCC-2998 






Gastric ca.* (liver met) NCI-N87 


13 1 




Bladder 




4 9 


Trachea 


13.7 




Kidney 


14 6 


o.*f 


Kidney (fetal) 


9.2 


4.2 


Renal ca. 786-0 


9.5 


7.3 


Renal ca. A498 


23.2 . 


. 19.4 


Renal ca. RXF 393 


16.9 . 


16.0 


Renal ca. ACHN 


14.4 


10.5 
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Renal ca.UO-31 ^ 


11.2 


SI 


Renal ca. TK-10 


5,4 


1 *T.O 1 


Liver 


11.2 


1 ^.T- 1 


Liver (fetal) 


24 1 


1 ft . 

I O.O * 1 


Liver ca, (hepatoblast) HepG2 


12 R 


1 inn 


Lung 




1 1 1 o 

1 1 X .7 1 


Lung (fetal) 


1 1 R 


1 Q O 1 


Lung ca. (small cell) LX-1 


1^ 2l 


1 Q /I ! 


Lung ca. (small cell) NCI-H69 


IS 5? 


1 1 7 n 1 
1 / .u j 


Lung ca. (s.ceU var.) SHP-77 


1 J..Z 


1 1 o i 


Lung ca. (large cell)NCI-H460 




1 oo >1 1 


Lung ca. (non-sm. cell) A549 


S 7 
J. / 


0.2 


Lung ca; (non-sxell) NCI-H23 




1 n n \ 


Lung ca (non-s.cell) HOP-62 




1 1 0 A 1 
IZ.U ' 1 


Lung ca. (non-s,cl) NCI.H522 


J. / 


4.0 


Lung ca. (squam.) SW 900 




i A i 1 

4.1 


Lung ca. (squam.) NCI-H596 




1 1 1 1 • ■ I 
11.1 


Mammary gland 


7 7 




Breast ca.* (pi. effusion) MCF-7 




A 1 1 


Breast ca.* (pLef) MDA-MB-231 




1 56.9 


Breast ca.* (pi. effiision) T47D 


A 7 




Breast ca. BT-549 




20.7 . .. 1 


Breast ca. MDA-N 


O.JJ 


6.2 


Ovary 




6.5 


Ovarian ca. OVCAR-3 




5,6 


Ovarian ca. OVCAR-4 


7Q A 


20.7 


Ovarian ca. OVCAR-5 




7.0 


Ovarian ca. OVCAR-8 


11.7 


9.8 * 


Ovarian ca. IGROV-1 


1 < J 


2.3 


Ovarian ca.* (ascites) SK-OV-3 


oi 1 
1 


17.0 


Uterus -* ■ 


o 1 


18.7 


Placenta 


1 n 0 1 


6.3 . \ 


Prostate 


in ^ 1 


10.5 


Prostate ca.* (bone met)PC-3 


9n ^ 1 




Testis 


77 7 


18.1 I 
19.5 • 


Melanoma Hs688fA).T 


o.o 1 


5.1 


Melanoma* (met) Hs688n3).T 


in 7 1 
lU. / 1 


8.5 


Melanoma UACC-62 


43.5 


36.3 . j 


MelanontiaM14 


42.1. 


. 37.0 . • 


Melanoma LOX IMVI 


7.9 


9.1 


Melanoma* (met) SK-MEL-5 


16.0 1 


14.2 


Adipose 


4.8 1 


3.8 1 
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• 

Tissue Name 


Relative 
£xpression(%) 




Relative 
Expression(%) 


2.2x4tm6395f 
a&2022 bl 


Tissue Name 


2.2x4tm6395f 
ag2022 bl 


Normal Colon GENPAK 061003 


24.5 


83793 Kidnev NAT rOD04348) 


70.6 


97759 Colon cancer (OD06064) 


14.3 


98938 Kidney malignant cancer 
(OD06204B) 


22.0 


97760 Colon cancer NAT 
(OD06064) 


18.0 


98939 Kidney normal adjacent 
tissue (OD06204E) 


18.7 


97778 Colon cancer fOD06159) 


12.2 


85973 Kidnev Cancer rOD04450- 

m 


48.2 


97779 Colon cancer NAT 
(OD06159) 


18.1 


85974 Kidnev NAT rOD04450-031 


16.6 


98861 Colon cancer (OD06297-04) 


15.5 


Kidney Cancer Clontech 8120613 


14.1 


98862 Colon cancer NAT 
(OD06297.015) 


23.7 


Kidney NAT Clontech 8120614 


40.5 


83237 CC Gr.2 ascend colon 

{KJDKjjyZl) 


25.7 


Kidney Cancer Clontech 9010320 


20.5 


83238 CC NAT fOD03921^ 


21.9 


Kidney NAT Qontech 9010321 


12.9 


97766 Colon cancer metastasis 
(OD06104) 


6.0 


Kidney Cancer Clontech 8120607 


48.0 


97767 Lung NAT (OD06104) 


19,5 


Kidney NAT Clontech 8 120608 


37.8 


87472 Colon mets to lung 
(OD04451-01) 


30.2 


Normal Uterus GENPAK 061018 


11.9 \. 


87473 Lunc NAT rOD04451.02^ 


11.4 


Uterus Cancer GENPAK 06401 1 


16.7 


Normal Prostate Qontech A+ 
6546-1 (8090438) 


21.9 


Nonnal Thyroid Clontech A-f 
6570-1 (7080817) 


9.6 


84140 Prostate Cancer rOD04410) 


9.1 


Thyroid Cancer GENPAK 064010 


19.0 


84141 Prostate NAT rOD04410) 


11.2 


Thvroid Cancer INVITROGEN 
A302152 


28.3 


Normal Ovary Res. Gen. 


58.3 


Thvroid NAT INVITROGEN 
A302153 


11.8 


98863 Ovarian cancer (OD06283- 

03) ^ 


12.9 


Normal Breast GENPAK 061019 


12.3 


98865 Ovarian cancer 
NAT/fallopian tube (OD06283-07) 


8.5 


84877 Breast Cancer rOD04566^ 


12.9' * 


Ovarian Cancer GENPAK 064008 


15.8 


Breast Cancer Res. Gen. 1024 


* 23.1 


97773 Ovarian cancer (OD()6 145) 


19.1 


85975 Breast Cancer /'OD04S00- 

m 


45.8 ' 


97775 Ovarian cancer NAT 
(OD06145) 


34.4 


85976 Breast Cancer Met*; 
fOD04590-03^ 


35.7 ' 


98853 Ovarian cancer (OD06455- 
03) 


11.0 


87070 Breast Cancer Metastasis 
rOD04655-05'i 


100.0 


98854 Ovarian NAT (OD06455- 
07) Fallopian tube..^ 


• 2.8 


GENPAK Breast Cancer 064006 


19.3 


Normal Lung GENPAK 061010 


11.2 


Breast Cancer Clontech 9100266 


8.3 . 


92337 Invasive poor Hiff, lung 
adeno (ODO4945-01 


19.1 ] 


Breast NAT Clontech 9 1 00265 


9.3 


92338 Lung NAT (ODO4945-03) 


] 

9.9 i 


Breast Cancer INVITROGEN 
^09073 


3.6 


84136 Lune Malignant Cancer 
rOD03126) 


: ] 

22.4 i 


Breast NAT INVITROGEN 
^090734 




84137 Lunff NAT rono^ 1 7^ 


8.6 S 


?7763 Breast cancer fOD06083) 


47.4 


90372 Lung Cancer fOD05014A) 


c 

24.5 I 


^7764 Breast cancer node 
netastasis (OD06083) 


48.8 


90373 Lung NAT fOD05014B) 


13.8 ! 


formal Liver GENPAK 061009 


29.2 
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97761 Lung cancer fOD06081) 


12:4 


Liver Cancer Researrli' rrpnf^tioc 
RNA 1026 


-31.1 . 


97762 Lung cancer NAT 
(OD06081) 


2.6 


Liver Cancer Research Genetics 
KNA1025 


39.1 


85950 Lune Cancer rOD04237-0n 


14.4 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004-T 


35.9 


85970 Lune NAT rOD04237.02^ 


29.6 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


8;1 ' 


33255 Ocular Mel Met to Liver 
fODO4310^ 


44.3 


Paired Liver Cancer TiQQnp 
Research Genetics RNA 6005-T 


744 


83256 Liver NAT rOD043 1 0^ 


12.2 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


74.3 


QHiDy ivieiauoma j>aets to J ..imp 
'OD0432n 


32.4 


Liver Cancer GENPAK 064003 


45,7 


84138 Lun^NAT rOD0432n 


13.2 


Nonnal Bladder GENPAK 061001 


19.3 


Normal Kidney GENPAK 061008 


14.8 


Bladder Cancer Research Genetirc 
RNA 1023 


29.9 


83786 Kidney Ca. Nuclear erade 2 
fOD04338^ 


40.8 


Bladder Cancer JNVTTROnTTM 
A302173 


30.8 ^ 


83787 KidnevNAT rOD043^R^ 


12.3 


Nonnal Stomach GENPAK 
061017 


48.5 


83788 Kidnev Ca Nuclear erade 
1/2 fOD04339^ 




Gastric Cancer Clontech 9060397 


16.4 


83789 Kidnev NAT fOD043.^0^ 


17.9 ] 


MAT Stomach Clontech 9060396 


33.1 


83790 Kidnev Ca. Clear cell tvni. 
fOD04340^ 


21.0 ( 


jastric Cancer Clontech 9060395 


14.5 


83791 Kidnev NAT ron04'^4m 


18.0 I 


N[AT Stomach Clontech 9060394 


34.4 


83792 Kidney Ca. Nuclear praH« ^ 
fOD04348^ 


16.6 ( 


jastric Cancer GENPAK 064005 


25.3 



l^leSO. Panel 4D 



Tissue Name 


Relative 
Expression(%^ 




Relative 
Expression^%'i 


4dx4tm4449f 
ag2022 bl 


Tissue Name 


4dx4tm4449f 
ag2022 bl 


93768_SecondaTy Thl anti- 
CD28/anti-CD3 


21.3 


93100_HUVEC (EndotheIial)_n> 


Is ^ 


93769_Secondary Th2 anti- 
CD28/anti-CD3 


16.3 


93779_HUVEC (Endothelial) IFN 

gamnia 


15.3 


93770__Secondary Trl anti- 
CD28/anti>CD3 

93573_Secondaiy Thl resting day 
4-6inIL-2 


0.0 


93102_HUVEC 

(Endothelial)_TNF alpha +.IFN 
gamma 


13.4 


5.2 


93101_HUVEC ' 

(Endothelial) TNFalnha + n^ 


13.7 


93572_Secondaiy Th2 resting day 
4-6inIL-2 1 


)93781_HUV£CCEndothehaI) IL- 
. 6.7 111 


13.7 


93571_Secondary Trl resting day 
4-6 in IL-2 


6.3 


93583_Lung Microvascular 
Endothelial Cells none 


14.8 


93568_j)riniary Thl anti- 
CD28/anti-CD3 


* 

15.6 i 


93584_Lung Microvascular 
Endothelial Cells TNFa(4ng/ml) 
md ILlb (1 ng/ml) 


18.0 


93569_priniary Th2 anti- 
CD28/anti-CD3 


12.9 c 


?2662_Microvascular Dermal * 
mdothelium none 


22.2 


93576_primary Trl anti- 
|CD28/anti-CD3 


• ■ i 
e 

20.1 - I 


)2663_Microsvasular Dermal 
;ndotheHum_TNFa (4 ng/ml) and 
Llb(lng/ml) 


18.8 
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93565 jrimary Thl resting dy 4-6 
inIL-2 


23.0 


93773_BronchiaI 

epitheliimi TNFa (4 ng/ml) and 

ILlb (1 ng/ml) ** 


5.8 


. 93566_jirimaiy Th2_restmg dy 4-6 
iiiIL-2 


11.4 


93347_Small Ahrway 
Epithelium none 


6.9 


93567_j)riinary Trl jresting dy 4-6 
mIL-2 


1.6 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) and 
ELlbd ng/ml) 


25.1 


93351_CD45RACD4 
lyinphocyte_anti-CD28/aiiti-CD3 


12.7 


92668_Coronery Artery 
SMC resting 


18.5 


93352_;CD45ROCD4 
lymphocyte_anti-CD28/anti-CD3 


11.4 


92669 Coronery Artery 
SMC_TNFa (4 ng/ml) and ILlb (1 
ng/ml) 


14.0 


93251_CD8 Lymphocytes^anti- 
CD28/anti-CD3 


12.4 


93107 astrocytes resting 


12.6 


93353_chromc CDS Lymphocytes 
2ry_restmg dy 4-6 in IL-2 


11.9 


93108_astrocytes__TNFa (4 ng/ml) 
andlLlbd ng/ml) 


15.4 


93574_chronic CDS Lymphocytes 
2ry_ activated CD3/CD28 


11.4 


92666_ KU-812 (Basophil) resting 


58.7 


93354 CD4 none 


3.2 


92667_KU-812 
(Basophil) PMA/ionoycin 


100.0 


93252 Secondary 
Thl/Th2/Trl anti-CD95 CHll 


9.0 


93579_CCD1106 
(Keratinocytes) none 


14.2 


ifoiKi^ i-kAJv cells resnng 


lO.o 


93580_CCD1106 
(Keratinocytes)_TNFa and IFNg 


1.7 


y J / oo JLAJv cells LL-Z 


12.8 


93791 Liver Cirrhosis 


1.6 


93787 LAK cells IL-2+IL-12 


9.3 


93792 Lupus Kidney 


1.4 


93789_LAK cellsJL-2+IFN 
gamma 


11.3 


93577 NCI-H292 


19.8 


93790 LAK cells IL-2+ IL-18 


9.3 


93358 NCI-H292 IL-4 


20.4 


93104_LAK 

cells_PMA/ionomycin and IL-18 


5.7 


93360 Na-H292 IL-9 


22.3 


93578 IsIK Cells IL-2 resting 


9.7 


93359 NCI-H292 IL-13 


12.5 


93 1 09_Mixed Lyn^hocyte 
Reaction Two Way MLR 


9.2 


93357 Na-H292 IFN gamma 


12.9 


93 11 0_Mixed Lyoqshocj^e 
Reaction Two Way MLR 


11.5 


93777 HPAEC - 


16.3 


93 1 ll_Mixed Lymphoc3^e 
Reaction Two Way MLR 


7.5 


93778_HPAEC_II^1 beta/TNA. 
alpha 


17.5 


93 1 12_Mononuclear Cells 
(PBMCs) resting 


3.9 


93254_Nonnal Human Limg 
Fibroblast none 


23 0 


93 1 13 Mononuclear Cells 
(PBMCs) PWM 


24.7 


93253__Normal Human Lung . 
Fibroblast TNFa (4 ng/ml) and IL- 
lb (1 ng/ml) 


14.3 


93114 Mononuclear CeDs 
(PBMCs) PHA-L 


12.6 


93257 Normal Human' Lung 
FibrolDlast IL-4 


30.8 


93249 Ramos (B cell) none 


13.6 


93256_Nom:ial Himian Limg 
Fibroblast IL-9 


27.0 


93250 Ramos (B cell) ionomycin 


32.2 


93255_Normal Human Lung 
Fibroblast IL-13 


19.5 


93349 B lymphocytes PWM 


■ 

49.8 


93258_Normal Human Limg 
fibroblast IFN gamma 


27.4 


93350 Blymphoytes CD40L and 
IL-4 


25.5 . i 


?3106_Dennal Fibroblasts 
CCD 1070 resting 


24.5 


92665_EOL-l . • 

(Eosinophil)_dbcAMP 

differentiated 


18.0 ( 


?3361_Demial Fibroblasts 
CCD1070 TNF alpha 4 ng/mi 


47.3 " 
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10 



15 



20 



93248_EOL-l 

(Eosmophil)_dbcAMP/PMAionoin 
ycin 


27.5 


yjl05_DerniaI Fibroblasts 
CCD1070 IL-1 beta 1 ng/ml 


'214 


yDDDo Jjenantic Cells none 


10.0 


93772_dennal fibroblast^IFN 
ganuna 


12.4 


93355 Dendritic Cells I 1 no 
ng/ml 


6.9 


93771 dprmal fiKTnWoe+ TT A 


22.5 


93775 Dendritic Cells anti.CD40 


8.9 


93260 IBD Colitis 2 


0.8 


yi774 Monocytes restinp 


11.1 


93261 IBD Crohns 


1.3 


93776 Monocytes UPS50nfi/inl 


7.0 


735010 Colon nomial 


13.7 


93581 Macrophages resting 


12,4 


735019 Lung none 


8.6 


93582_Macrophages LPS 100 
ng/nil 


5.8 


64028-1 Thymus none 


9.9 


93098_HUVEC 
(Endothelial) none 


22.1 


54030-1 Kidney none 


18.6 


93099_HUVEC 
(Endothelial) starved-. 


22.3 







PaneU^D SBinmaiy Two i^ults using the same probe and primer set show re^^^^ 
that are in excellent agreement, with highest expression of the NOV9 gene in adult skeletal 
muscle (CTs=27). The NOV9 gene also shows moderate expression in other tissues with 
metaboUc function including adipose, adult and fetal heart and hver, adult skeletal muscle, 
pancreas, and the adrenal, thyroid, and pituitary glands. Expression is much lower in fetal ' 
skeletal muscle (CTs=30) relative to the adult tissue (CTs=27), which may impHcate the 
expression of this gene in differentiation of skeletal muscle and flius suggests that expression 
of the NOV9 gene could be used to differentiate between the adult and fetal phenotypes of this 
tissue. The pathway mediated by MAP kinase kinase (MAPKK) has been shown to influence 
myoblast proHferation and both insulin and exercise stimulate signaling via this pathway in 
skeletal muscle, hisulin resistance in obese and diabetic subjects may in part be due to tmnor 
necrosis factor alpha, whose effects are mediated through interference with the normal 
activation of MAPKK by insulin. In addition, exercise training significantly hnproves insulin- 
mduced MAPKK activity in obese Zucker rats. This indicates that an acti^^^^^ 
maybe an effective pharmaceutical agent in (he treatment of diabetes. Furthemiore, activation 
of the MAPKK pafliway is involved in adipocyte differentiation from preadipocytes in 
androgen deficiency. Therefore, a MAPKK antagonist may be a suitable pharmacological 
agent in the treatment of obesity in some cases. 

The NOV9 gene is expressed at higher levels in cell lines derived from melanoma, and 
kidney and lung cancers compared to the normal . tissues and may play a role in cancers in 
Ihese tissues. Thus, the expression of this gene could be usefixl as a marker or as a therapeutic 
for lung and kidney cancer as well as melanomas. In addition, therapeutic modulation of the 
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* activity of the gene product, through the use of peptides, chimeric molecules or small 

molecule drugs, may be useful in the therapy of these cancers. 

The N DV9 gene, a homolog of Mitogen Activated Protein Kinase Kinase, is expressed 
at high to moderate levels across the brain, with highest expression in the central nervous seen 
5 in the thalamus (CT=28.4). Mitogen Activated Protein Kinase Kinase is activated by Valproic 
acid, a drug that is used to treat both seizure disord^ and bipolar depression. Valproic acid is 
believed to work by increasing neuronal production of GABA, the major inhibitory 
neurotransmitter in the brain. Selective activation of this kinase may therefore have 
therapeutic benefit in the treatment of seizure disorders, bipolar disorder, or in any other 
1 0 neurological/psychiatric condition believed to be caused by a GABA deficit (schizophrenia). 

Panel 2.2 Summary Highest expression of the NOV9 gene in this panel is seen in a 
breast cancer sample (CT — 29.0). The expression of this gene shows an association with 
samples derived from breast and kidney cancers when compared to the matched normal tissue. 
Thus, expression of the NOV9 gene could be useful as a marker for breast and kidney cancers. 
1 5 Furthermore, therapeutic activity of the product of this gene, through the use of peptides, 
chimeric molecules or small molecule dnigs, may be useful in the treatment of breast and 
kidney cancers. 

Panel 4D Summary Expression of the NOV9 gene is ubiquitous throughout this panel. 
Higihest expression of this gene is foimd in the basophil cell line, KU-812, upon activation 

20 \yith PMA/ionomycin (CT=26.2), compared to non-activated cells. High expression of the 

N0V9 gene is also found on activated B cells, a B cell line, and dermal fibroblasts. The NOV9 
gene is homologous to a Mitogen Activated Protein Kinase Kinase 2 (MAPKK2), a serine 
threonine kinase which functions downstream of Raf in the signaling pathway that affects 
proliferation.and differentiation. The high expression of this kinase on basophiles suggests a 

25 role' for this kinase in mast cell/basophile signal transduction. Activated mast/basophile cells 
have been associated with many atopic diseases, including asthma, atopic contact dermatitis, 
allergies, and rhinitis. Therefore, therapeutic modulation of the expression or function of the 
N0V9 gene product, through the use of small molecule dmgs, might be beneficial in the 
treatment of these diseases. In addition, the high expression of this kinase in activated B cells 

30 suggests that the use of small molecule drags designed to the NOV9 gene product could 
prevent B cell hyperproliferative disorders such as autoiromune diseases and lymphomas 
(Yuan et al.. The mood stabilizer valproic acid activates mitogen-activated protein kinases and 
promotes neurite growth. J Biol Chem. 276:31674-83, 2001; Bulleit and Hsieh, MEK 
inhibitors block BDNF-dependent iEind -indepmdent expression of GABA(A) receptor subunit 
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- mRNAs in cultured raoiise cerebeUai' gfan Brain Res Dev Brain Res. 1 19:1-10, 

2000; Jones at al., ERKl72 is required for myoblast proUferation but is dispensable for muscle 
gene expression and cell fusion. J Cell Physiol. 186:104-15, 2001; Wojtaszewski et al.; 
Differential regulation of MAP kinase by contraction and iiisulin in skeletal muscle: metabolic 
5 : implications. Am J Physiol. 277(4 Pt l):E724-32, 1999; Begum et al.. Effect of tumor necrosis 
factor-alpha on insulin-stimulated mitogen-activated protein kinase cascade in cultured rat 
skeletal muscle cells. Eur J. Biochem. 238:214-20, 1996; Osnwn et al.. Exercise training 
increase ERK2 activity in skeletal muscle of obese Zucker rats. J Appl Physiol. 90:454-60, 
2001; I^asa et al.. Site-related specificities of the control by androgenic status of 
10 adipogenesis and mitogen-activated protein kinase cascade/c-fos signaling pathways in rat 
preadipocytes. Endocrinology 138:3181-6, 1997). 

NOVll: Thymosin beta 10-Iike 

- Bxpression.oftheNOVlla gene (GMAC079400_A) and variant NOVlib 
(CG109754-01) was assessed using the primer-probe sets Ag243 1 and Ag3087 described in 
15 Tables 51. Results fix)mRTQ-PCR runs are diown in Tables 52, 53,^^5^^ 



Table 51 . Probe Name Ag243 1 



Primers 


Sequences 


TM 


Length 


Start 
Position 


SEQ XD 
NO: 


Forward 


5 ' -AGAAAATGGCAGACAAACCA-3 ' 


58.2 


20 


23 


135 


Probe 


TET-5 * -AATCGCCAGCTTCAATAGGGCCAAG-.3 ' - 

TAMRA ^ • - 


70.7 


25 


54 ' 


136 


Keverse p ■ -GCGTCTCCGTTTTCTTCAG-3 » 


5B.6 


19 


79 


137 



Table 52 . Panel 1 .3D 





Relative 

Expression(%) 


Tissue Name 


Relative 


Tissue Name 


1.3dtm4242t 
as2431 


Expression(%) 
13dtm4242t 
ag2431 


Liver adenocarcinoma* 


13.7 


Kidney (fetal) 


3.6 


Pancreas 


0.9 


Renal ca. 786-0 


2.4 


Pancreatic ca. CAP AN 2 


0.4 


Renal ca. A498 


3.6 


Adrenal gland 


3.0 


Renal ca. RXF 393 


0.5 


Thyroid 


- 0.9 


Renal ca. ACHN 


■ 0.1 


Salivary gland 


1.2 


Renal ca.UO-31 


0.2 


Pituitary gland - 


1.5 


Renal ca. TK-10 


0.3 


Brain (fetal) 


28.1 . 


Liver. „.. 


1.6 


Brain (whole) 


2.2 


Liver (fetal) . 


3.6 


Brain (amygdala) 


24.1 


Liver ca. (hepatoblast^ HepG2 


0.8 


Brain (cerebellum) - • . - 


1.5 


Lung " 


' 8.6 


Brain (hippocampus) - 


" 100,0 


Lung (fetal)* 


5.0 
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Brain (substantia nigra) 


1.9 


Lung ca. (small cell) LX-1 


1.4 


Brain (thalamus) 


6.3 


Lung ca. (small cell) NCI-H69 


1.1 


Cerebral Cortex 


20.6 


Lung ca. (s.cell var.) SHP-77 


1.4 


Spinal cord 


1.0 


Lung ca. (large cell)NCr-H460 


1.0 


CNS ca. (glio/astro) U87-MG 


9.4 


Lung ca. (non-sm. cell) A549 


1.7 


CNS ca. (gUo/astro) U-118-MG 


6.5 


Lung ca. (non-s.cell) NCI-H23 


1.4 


CNS ca. (astro) SW1783 


3.5 


Limg ca (non-s.cell) HOP-62 


0.4 


CNS ca.* (neuro; met ) SK-N-AS 


5.6 


Ltmg ca. (non-s.cl) NCI-H522 


0.8 


CNS ca. (astro) SF-539 


1.5 


Lung ca. (squam.) SW 900 


1.6 


CNS ca. (astro) SNB-75 


3.9 


Lung ca. (squam.) NCI-H596 


0.5 


CNS ca. (gHo) SNB-19 


1,8 


Mammary gland 


3.3 


CNSca. (glio)U251 


0.6 


Breast ca.* (pi. effiision) MCF-7 


3.8 


CNS ca. (gUo) SF-295 


0.4 


Breast ca.* (pl.ef) MDA-MB-231 


32.3 


Heart (fetal) 


3.1 


Breast ca.* (pi. effusion) T47D 


0.2 


Heart 


1.5 


Breast ca. BT-549 


25.3 


Fetal Skeletal 


6.8 


Breast ca. MDA-N 


1.0 


Skeletal muscle 


0.5 


Ovary 


4.5 


Bone marrow 


7.9 


Ovarian ca. OVCAR-3 


1.1 


Thymus 


0.6 


Ovarian ca. OVCAR-4 


0.1 


Spleen 


2.5 


Ovarian ca. OVCAR-5 


0.6 


Lymph node 


4.4 


Ovarian ca. OVCAR-8 


1.1 


Colorectal 


3.8 


Ovarian ca. IGROV-1 


0.8 


Stomach 


1.4 


Ovarian ca.* (ascites) SK-OV-3 


2.9 


Small intestine 


3.2 


Uterus 


1.5 


Colon ca. SW480 


1.1 


Placenta 


1.7 


Colon ca.* (SW480 met)SW620 


0.7 


Prostate 


1.4 


Colon ca. HT29 


1 n 
1./ 


Prostate ca.* (bone met)PC-3 


1.5 


Colon ca. HCT-1 16 


l.O 


Testis 


0.7 


Colon ca. CaCo-2 




\ (flit n «T I-, .1, , TT-.^OO/' A N rwy 

VLeianoma Hsoo8(A).T 


3.4 


83219 CC Well to Mod Diff 
(OD03866^ 


5.3 1 


VIelanoma* (met) Hs688(B).T 


- 1.6 


Colon ca.HCC-2998 


3.8 I 


Vlelanoma UACC-62 


0.5 


Gastric ca.* (liver met) NCI-N87 


4.7 1 


^danoma M14 


1.2 


Bladder 


8.4 I 


Melanoma LOX IMVI 


12.7 


Trachea 


3.1 I 


Melanoma* (met) SK-MEL-5 


2.8 


Kidney 


1.2 / 


Adipose 


• 3.7 



Table 53 . Panel 2D 



Tissue Name 


Relative 
Expression(%) 


Tissue Name 


Relative 
Expression(%) 


2dtm4243t 
as2431 


2dtm4243t 
. ae2431 


Normal Colon GENPAK 061003 


33.0 


Kidney NAT Clontech 8120608 


1.1 


83219 CC Well to Morf nifF 
rOD03866^ 


21.8 


Kidney Cancer Clontech 8120613 


1^4 


83220 CC NAT rODn^R^?^^^ . . 19,8 


Kidney NAT Qontech 8120614 


1.3 
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\^\j> vjr.z reciosigmoici 
fOD03868^ 


26.6 


Kidney Cancer Clontech 901 0320 


/^3:o 


83222 CC NAT rODOSSeS'i 


2,1 


Kidney NAT aontech 9010321 


3.0 


83235 CC Mod DiffrODO3920^ 


26.1 


Normal Uterus GENPAK 061018 


1:6 ■ 


83236 CC NAT rODO^Q:)0^ 


13.9 


Uterus Cancer GENPAK 06401 1 




83237 CC Gr.2 ascend colon 
fOD0392n 


49.7 


Normal Thyroid Clontech A+ 
6570-1 




83238 CCNAT rOD0392n 


8.4 


Thyroid Cancer GENPAK 064010 


5.2 1 


83241 CC from PnTtinl 
HcDatectomv rODO4309^ 


13,9 


Thyroid Cancer INVITROGEN 
A302152 


n n * • \ 
Z. / 1 


83242 Liver NAT rODO4309^ 


3.9 


Thyroid NAT INVITROGEN 
A302153 




87472 Colon mets to lung 
fOD04451-0n 


5.6 


Normal Breast GPNP ATT C)f^^ 01 0 


•2 0 -I 

D*Z. j 


87473 Lime NAT rOD0445U02^ 


6.7 


84877 Breast Cancer rOD04566^ 


1.9 1 


Normal Prostate Clontech A+ 
6546-1 


1.4 


oiy /3 iireast Cancer iOD04590- 

01} 


' 3.4 - 


84140 Prostate Cancer VoD044 10^ 


7.4 


ojy /o lireasi v_/ancer JVLeis - • 
fOD04590-03^ 


4.7 


84141 Prostate NAT ron04410) 


6.4 


87070 Breast Cancer Metastasis 
rOD04655.05^ 


8.0 • 1 


87073 Prostate Cancer rOD04720- 

m 


3.2 


GENPAK Breast Panrer 0^4001^ 




87074 Prostate NAT rOD04720- 
02) 


6.2 


Breast Cancer Res. GeiL 1024 


4 0 


Normal Lung GENPAK 061010 


17.7 


Breast Cancer Clontech 9100266 


9.4. 


53239 Liinp Met tn MyicMo 
rOD04286) 


33.4 


Breast NAT Plnntprln 01 nn9#;^ 


0.8 


83240 Muscle NAT rOD04i286) 


3.0 


Breast Cancer INVITROGEN 
A209073 


19.2 


54136 Lunp Malip;nflnt Cancer 
rOD03126) 


7.3 


Breast NAT INVITROGEN 
A2090734 


5.0 


84137 Lune NAT rOD031 26^ 


6.6 




0.6 


84871 Lune Cancer rOD04404) 


5.0 


Liver Cancer GENPAK 064003 


1.8" 1 


84872 Lune NAT ron044n4^ 


4.9 


Liver Cancer Research Genetics 
RNA 1025 


* 1.2 


84875 Umz Cancer rOD04565^ 


4.2 


Liver Cancer Research Genetics 
RNA 1026 




84876 Lun^ NAT rOD04565^ 


1 
J 

4.2 ] 


Paired Liver Cancer Tissue 
R^esearch Genetics RNA 6004-T 


1.1 


85950 Lunp Cancer rOD04237.01) 


1 

; 

19.5 < 


Paired Liver Tissue Research . 
aenetics RNA 6004-N 


9 1. 


85970 Lunff NAT rOD04237.09.^ 


] 

8.7 1 


^aired Liver Cancer Tissue 
Research Genetics RNA fifln^.T 


'^■i- f 


83255 Ocular Mel Met to Liver 
(QD0431Q1 


2.7 ( 


^aired Liver Tissue Research 
jenetics RNA 6005-N 


1.0 1 


83256 Liver NAT rOD043 1 0^ 


2,1 . I 


formal Bladder GENPAK 061001 


* 19.6 


84139 Melanoma Mets to Lung 
rOD0432n 


I 

8.0 I 


bladder Cancer Researrli C^t>-nf»i^fc 
LNA 1023 


8.7 


84138 Lun^ NAT rOD04321 V 


I 

6.9 / 


iladder Cancer INVITROGEN . 

G02173 


16.5 


Normal Kidney GENPAK 061008 


8 

■ 2.6 0 


7071 Bladder Cancer COD04718- 

n 


20.3 - ' \ 


53786 Kidnev Ca. Nuclear prade 2 
rOD04338) 


- 8 

'7.9' . . a 


7072 Bladder Normal Adiacent 
0D04718-03^ 


ib.i ' 


83787 Kidnev NAT rOD043^R^ 


2.5 h 


formal Ovary .Res. Gen. ' ( 


- • 9.4 • :| 
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83788 Kidnev Ca Nuclear erade 
1/2 fOD04339^ 


6.6 


Ovarian Cancer GENPAK 064008 


15.1 


83789 Kidnev NAT rOD04339^ 


1.4 


87492 Ovarv Cancer COD04768- 
07} 


7.1 


83790 Kidnev Ca. Clear cell tvoe 
fOD04340^ 


6.9 


87493 Ovarv NAT rOD04768-08^ 


3.5 


83791 Kidnev NAT fOD04340^ 


3.9 


Noimal Stomach GENPAK 
061017 


4.7 


83792 Kidnev Ca. Nuclear erade 3 
(OD04348) 


18.2 


Gastric Cancer Clontech 9060358 


2.1 


83793 Kidnev NAT rOD04348^ 


3.6 


NAT Stomach Clontech 9060359 


17.7 • 


87474 Kidnev Cancer fOD04622- 


10.8 


Gastric Cancer Clontech 9060395 


14.7 


6 iH ij jsjonev JN Ai ( uuu'Joz^-U3 ) 


1.2 


NAT Stomach Clontech 9060394 


22.7 


85973 Kidnev Cancer fOD04450- 

on 


6.0 


Gastric Cancer Clontech 9060397 


100.0 


85974 Kidn^ NAT (GDMASMtV^ 


1.9 


NAT^xomach Clontech 9060396 


14.8 


Kidney Cancer Clontech 8120607 


3.2 


Gastric Cancer GENPAK 064005 


32.3 



Table 54. Panel 4D 





Relative 
£xpression(%) 




Relative 
Expressionf%) 


Tissue Name 


4dtni4244t . 
ag2431 


Tissue Name 


4dtm4244t 
afi2431 7 


93768_Secondary Thl_anti- 
CD28/anti-CD3 


97.9^ 


93100 HUVEC (Endothelial) IL- 
Ib 


4.8 


yo7oy_k>econaary ThZ^ann- 
OP28/anti-CD3 


69.7 


93779_HUVEC.(Endothelial)_IFN 
gamma 


15.2 


93770_Secondary Trl^anti- 
CD28/anti.CD3 


100.0 


93102_HUVEC 
(Endothelial)_TNF alpha + IFN 
gamma 


11.8 


93573_Secondary Thl_resting day 
4-6inIL-2 


65.5 


93101_HUVEC 

(Endothelial) TNF alpha + IL4 


20.4 . 


93572_Secondary Th2_resting day 
4-6 in IL-2 


50.0 


93781 HUVEC (EndotheHal) IL- 
11 


14.7 . 


93571_Secondary Trl_resting day 
4-6inIL-2 


46.7 


93583_Lung Microvascular 
Endothelial Cells none 


32.1 


93568_primary Thl anti- 
CD28/anti-CD3 


92.0 


93584_Lung Microvascular 
Endothelial Cells_TNFa (4 ng/ml) 
and ELlb (1 ng/ml) 


38.2 


93569_primary Th2 anti- 
CD28/anti.CP3 


79.0 


92662_Microvascular Dermal 
endothelium none 


41.2 


93570_prinMryTrl auti- 
CD28/anti.CD3 


57.0 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and 
ILlb (1 ne/ml) 


34.9 


93565 jrunaiy Thl restmgdy4-6 
inIL-2 


80.1 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

ILlb (1 nfi/ml) 


51.8 


93566_primary Th2 resting dy 4-6 
inIL-2 


61.6 


93347_Small Airway 
Epitheliimi none 


18.8 


93567jprimary Trl resting dy 4-6 
inIL-2 


31.6 


93348_SmaU Airway 
Epithelium TNFa (4 ng/ml) and 
ILlbd ng/ml) - 


43.2 


93351_CD45RA CD4 
lymphocyte anti-CD28/anti-CD3 


- 

46.0 


92668_Coronery Artery 
SMC resting 


30.8 - . 


93352 CD45ROCb4 


40.1 


92669 Coronery Arteiy 


20.7 
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lyniphocyte_anti-CD28/aiati-CD3 



SMG^TNFa (4 ng/ml) and ILlb (1. 
ng/ml)^ 



93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 



29.5 



93107^astrocytes resting 



11.0 



93353_chromc CDS Lymphocytes 
2ry - resting dy 4-6 in IL-2 



33.0 



93108_astrocytes_TNFa (4 ng/ml) 
arid ILlbQ ng/ml) 



11.2 



93574_chrpnic CDS Lymphocytes 
2ry_ activated CD3/CD28 



38.4 



92666 KU.8 12 (Basophil) resting 



9.7 



93354 CD4 none 



20.0 



92667_KU-812 
(Basophil)_PMA/ionoycin 



18.9 



93252_Secondary 
ThlyTh2/Trl anti-CD95 CHI 1 



49.3 



93579_CCD1106 
(Keratinocytes) none 



46.0 



93103 LAK cells resting 



85.9 



93580_CCD1106 
(Keratiiiocytes)_TNFa and IFNg 



31.9 



93788 LAK cells IL-2 



32.3 



93791,Liver Cirrhosis 



4:9 



93787 LAK ceUs IL-2'fIL>12 



24.1 



93792 Lxrpus Kidney 



2.6 



93789_LAK cells_IL"2+IFN 



20.9 



93577 NCI-H292* 



17.8 



93790 LAK cells, IL-2+ IL-18 



14.0 



93358 NCI-H292 IL.4 



15.1 



93104_LAK 

cells PMA/iononrycin and IL-18 



36.3 



93360 Na-H292 IL-9 



173- 



93578 NK Cells IL-2 resting 



14.4 



93359 NCI-H292 IL-13 



5.0 



93109__Mixed Lynq)hocyte 
Reaction Two Way MLR 



29.7 



93357 NCI-H292 IFN gamma 



13.5 



93 1 10_Mixed Lymphocjrte 
Reaction Two Way MLR 



27.2 



93777 HPAEC 



14.3 



93 1 1 l_Mixed Lyxx^hocyte 
Reaction Two Way MLR 



34.9 



93778_HPAEC_IL-1 betamsTA 
Ipha 



30.1 



931 12_Mononuclear Cells 
(PBMCs) resting 



16.6 



93254_Nonnal Human Lung 
Fibroblast none 



39.8, 



93 1 13_Mononuclear Cells 
(PBMCs) PWM 



36.6 



93253_Normal Hmnan Lung 
Fibroblast^TNFa (4 ng/ml) and BL- 1 
lb (1 ng/ml) ■. 



38.7 



931 14_Mononuclear Cells 
(PBMCs) PHA-L 



35.4 



93257_Normal Himoan Lung 
Fibroblast IL-4 



63.7 



93249, Ramos (B cell) none 



11.8 



93256_Nonnal Human Lung 
Fibroblast IL-9 



55.5 



93250 Ramos (B cell); ionomycin 



13.7 



93255_NormaI Human Lung 
Fibroblast IL-13 



93349 B lymphocytes PWlVI 



93350_B lymphoytes_CD40L and 
IL-4 . 



25.5 



93258_Normal Human Lung 
FibroblastlFN gamma 



10.8 



93 1 06_Dennal Fibroblasts 
CCD1070 resting 



59.5 



78.5 



57.8 



92665_EOL-l 
(Eosinophil)_dbcAMP 
differentiated ^ ' 



5.1 



93361_Dermal Fibroblasts 
CCD1070 TNF alpha 4 ng/ml 



64.2 



93248_EOL-l 

(Bosinophil)_dbcAMP/PMAionom 
ycin . . ^ 



15.6 



93 105_Dermal Fibroblasts 
CCD1Q70 IL-1 bete 1 ng/ml 



27.2 



93356 Dendritic Cells none 



33.4 



93772_dermal fibroblast_IFN 
gamma ' 



33.2 



93355_Dendritic Cells_LPS 100 . 
ng/ml 



31.0 



93771 dermal fibroblast ILr4 



38.2 



93775 Dendritic CeUs anti-CD4n 



25.7 



93260 IBD Colitis 2 



4.0 



93774 Monocytes resting 



29.9 



93261 IBDCrohns 



7.9 



93776 Monocytes LPS 50 ng/ml 



14.4 [735010 Colon normal 
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93581 Macrophages resting : - v 


28.3 


735019_Lung none 


38.4 


93582^Macrophages_LPS 100 
ng/ml 


35.1 


64028-1 Thymus none 


18.7 


93098_HUVEC 
(Endotiielial) none 


28.1 


64030-1 Kidney none 


52.1 


93099_HUVEC 
(Endothelial) starved 


37.1 







Table 55 . Panel CNS_neurodegeneration_vl.O 





' Relative 
Expression(%) 




Relative 

KYnr PC ci n n ^'^'^ 


Tissue Name 


tm6902t 
ag2431 hlsl 


Tissue Name 


tm6902t 
ag2431 bls2 


AD 1 Hippo 


9.1 


Control (Path) 3 Temporal Ctx 


2,0 


AD 2 Hippo 


0.0 


Control (Path) 4 Temporal Ctx 


0.0 


AD 3 Hippo 


0.0 


AD 1 Occipital Ctx 


2.0 


AD 4 Hippo 


0.0 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


61.4 


AD 3 Occipital Ctx 


3:6 


AD 6 Hippo 


6.2 


AD 4 Occipital Ctx 


0.0 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx 


4.1 


Control 4 Hippo 


0.0 


AD 6 Occipital Ox 


34.0 


Control (Path) 3 Hippo 


0.0 


Control 1 Occipital Ctx 


0.0 


AD 1 Temporal Ctx 


2.9 


Control 2 Occipital Ctx 


4,5 


AD 2 Temporal Ctx 


0.0 


Control 3 Occipital Ctx 


3.4 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital Ctx 


4.1 - 


AD 4 Temporal Ctx 


0.0 


Control (Path) 1 Occipital Ctx 


5.7 


AD 5 Inf Temporal Ctx 


52.5 


Control (Path) 2 Occipital Ctx 


2.1 


AD 5 SupTemporal Ctx 


100.0 


Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf Temporal Ctx 


6.6 


Control (Path) 4 Occipital Ctx 


9.5 


AD 6 Sup Temporal Ctx 


16.6 


Control 1 Parietal Ctx 


0.0 


Control r Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


61.0 


Control 2 Temporal Ctx 


■1.7 


Control 3 Parietal Ctx 


1.3 


Control 3 Temporal Ctx 


1.3 


Control (Path) 1 Parietal Ox 


13.6 


Control 4 Temporal Ctx 


0.0 


Control (Path) 2 Parietal Ctx 


0.0 


Control (Path) 1 Temporal Ctx 


6.6 1 


Control (Path) 3 Parietal'Ctx 


3.1 


Control (Path) 2 Temporal Ctx 


7.4 ( 


Control (Path) 4 Parietal Ctx 


1.4 



Panel 1.3D Summary Ag2431 T he NOVl 1 gene, a homolog of thymosin beta 1 0, is 
5 most higWy expressed in the hippocampus (CT=24.2) and is e;q)ressed widely in the ^ 
This gene appears to be important in the process of gliosis, which is ahallmaik of all of the 
neurodegenerative diseases. Furthermore, the formation of a glial scar is one of the principle 
barriers to neuroregeneration in response to spinal cord injury and head trauma. Therefore, the 
selectiye down-regulation of this gene and/or its protein product maybe beneficial in the 
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treattnent of spine or head injury, or in any of the neurodegenerative diseases (Alzheimer's, 
Parkinson's, Huntington's, spinocerebellar ataxia, etc). 

The NOVl 1 gene also has moderate to low expression in many metabolic tissues 
including adipose, adrenal, aduh and fetal heart, adult and fetal liver, adult and fetal skeletal 
5 muscle, pancreas, pituitary and thyroid. The gene appears to be expressed at higher levels in 
fetal skeletal muscle (CT=28) than in adult skeletal muscle (CT=3 1 .7) and could potentially be 
used to distinguish between the adult and fetal phenotypes of tins tissue. In addition, the 
greater expression in fetal skeletal muscle suggests that the NOVl 1 gene may play a role in 
muscular growth or development in the fetus and therefore could act in a regenerative edacity 
10 in an adult. Thus, therapeutic modulation of the NOVl 1 gene could be usefiil in the treatment 
of muscle related diseases and treatment with the protein product could restore musc^t^iass or 
fimction to weak or d3^trophic miiscle. 

The NOVl 1 gene is expressed at significant levels in cell lines derived firom breast 
cancer, liver cancer and melanoma when compared to expression in the corresponding normal 
15 tissues. Thus, the expression of this gene could be useful as a marker or as a therapeutic for 
breast and liver cancer, as well as melanomas. In addition, therapeutic modulation of the 
activity of the protein encoded by the NOVl 1 gene, through the use of peptides, antibodies, 
chimeric molecules or small molecule drugs, may be useful in the therapy of these cancers. 
Panel 2D Summary Ag2431 H ighest expression of the NOVl 1 gene is seen in a 
20 gastric cancer sample (CT=23.5). The expression of this gene in panel 2D shows an 

association with samples derived jfrom ovarian, bladder, liver, breast, kidney and colon cancers 
when compared to the matched normal tissue. A lung cancer that has metastasized to muscle 
also shows increased exprerssion of this gene when compared to ftie adj acent muscle tissue. 
Thus, repression of the NOVl 1 gene could be of use as a marker for these cancers. 
25 Furthermore, therapeutic modulation of the activity of tiie product of this gene, through the use 
of peptides, antibodies, chimeric molecules or small molecule drugs, may beneficial in the 
treatmen of these cancers. 

Panel 4D Summary Ag2431 The NOVl 1 gene is ubiquitously expressed throughout 
this panel in both normal cell types and cell lines, regardless of their activation status. This 
30 gene encodes a protein that has homology with Thymosin beta-1 0. Some reports indicate that 
thymosin beta 10 (as thymosin beta 4- which is functionally very similar) is an effective 
regulator of a large subset of actin filaments in Uving cells. Reduced expression of thymosin 
beta-1 0 may contribute to the senescent phenotype by reducing EC plasticity and thus 
impairing their response to migratory stimuh. Therefore, therapeutics designed with the 
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protein encoded for by the NOVl 1 gene may play a role in maintaining or restoring normal 
function of lymphoid, limg, dermal fibroblasts, endothelial cells and could be beneficial in 
preventing aging of the cells. 

Panel CNS^neurodegenerataion^vl.O Summary Ag2431 Expression of the 
NOVl 1 gene is restricted to a few samples in this panel, with highest expression in the 
cerebral cortex of an Alzheimer's patient (CT=33.5). While no association between the 
expression of this gene and the presence of Alzheimer's disease is detected in this panel, 
these results confirm the expression of this gene in the brains of a fiirther set of 
individuals. Please see Panel 1.3D for a discussion of potential utility of this gene in the 
central nervous system (Carpiatero et al., Expression of the thymosin betalO gene in 
noiinal and kainic acid-treated rat forebrain. Brain Res Mol Brain Res. 70: 141-6, 1999). 
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OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been 
done by way of example for purposes of illustration only, and is not intended to be limiting 

5 with respect to the scope of the appended claims, which follow. In particular, it is 

contemplated by the inventors that various substitutions, alterations, and modifications may be 
made to the invention without departing fi-om the spirit and scope of the invention as defined 
by the claims. The choice of nucleic acid starting material, clone of interest, or library type is 
believed to be a matter of routine for a person of ordinary skill in the art with knowledge of the 

0 embodiments described herein. Other aspects, advantages, and modifications considered to be 
within the scope of the following claims. 
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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: . 

(a) a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28,30, 32 and/or 34; 

(b) a variant of a mature form of an amino acid sequence selected from the 
group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32 and/or 34, wherein one or more amino acid residues 
in said variant differs from the amino acid sequence of said mature 
form, provided that said variant differs in no more than 15% of the 
amino acid residues from the amino acid sequence of said mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and/or 34; 
and 

(d) a variant of an amino acid sequence selected from the group consisting 
of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 
and/or 34 wherein one or more amino acid residues in said variant 
differs from the amino acid sequence of said mature form, provided that 
said variant differs in no more than 15% of amino acid residues from 
said amino acid sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid 

sequence of a naturally-occurring allelic variant of an amino acid sequence selected 
from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32 and/or 34. 

5. The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single 
nucleotide from a nucleic acid sequence selected from the group consisting of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and/or 33. 
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4. . ,.,The polypeptide of claim 1 , wherein the amino, acid sequence of jsaid variant comprises 

a conservative amino acid substitution. 

5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding a 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28,30, 32 and/or 34; 

(b) avariantof a mature form of an amino acid sequence selected from the 
group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32 and/or 34, wherein one or more amino acid residues 
in said variant differs from the amino acid sequ^ce of said mattire 
form, provided that said variant differs in no more than 1 5% of the 
amino acid residues from the amino acid sequence of said mature form; 

(c) an ainino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 and/or 34; 

(d) a variant of an amino acid sequence selected from the group consisting 
of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 
and/or 34, wherein one or more amino acid residues in said variant 
differs from the amino acid sequence of said mature form, provided that 
said variant differs in no more than 1 5% of amino acid residues from 
said amino acid sequence; 

: (e) a nucleic acid fragment encoding at least aportion of a polypeptide 

comprising an amino acid sequence chosen from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 
and/or 34, or a variant of said polypeptide, wherein one or more amino 
acid residues in said variant differs from the amino acid sequence of 
said mature form, provided that said variant differs in no more than 15% 
of amino acid residues from said amino acid sequence; and 
if) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) 
or(e). 
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6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a naturally-occurring allelic nucleic acid variant. 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturally-occurring polypeptide 
variant. 

8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of 

. , SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and/or 33. 

9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and/or 33; 

(b) a nucleotide sequence differing by one or more nucleotides from a 
nucleotide sequence selected from the group consisting of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and/or 33, 
provided that no more than 20% of the nucleotides differ from said 
nucleotide sequrace; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragm^t of (b). 

10. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting 
of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 and/or 33, or a 

, complement of said nucleotide sequence. 

11. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected fix>m the group consisting of 

(a) a first nucleotide sequence comprising a coding sequence differing by 
one or more nucleotide sequences from a coding sequence encoding 
said amino acid sequence, provided that no more than 20% of the 
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; ^ nucleotides in the coding sequence in said first nucleotide sequence 
differ from said coding sequence; 

(b) an isolated second polynucleotide fliat is a complement of the first 
polynucleotide; iand 

(c) a nucleic acid firagment of (a) or (b). 

12. A vector comprising the nucleic acid molecide of claim 11. 

13. The vector of claim 12, further comprising a promoter operably-linked to said nucleic 
acid molecule. 

14. A cell comprising the vector of claim 12. 

15. An antibody that immunospecifically-binds to the polypeptide of claim 1. 

16. The antibody of claim 15, wherein said antibody is a monoclonal antibody. 

1 7. The antibody of claim 1 5, wherein the antibody is a hmnanized antibody. 

18. A method for determining the presence or amoimt of the polypq)tide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically 
' ' to the polypeptide; and 

(c) detemiining the presence or amount of antibody bound to said 
polypeptide, 

thereby detenrdning the presence or amount of polypeptide in said sample. 

19. A method for determining the presence or amount of the nucleic acid molecule of 
claim 5 in a sample, the method comprising: ' 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid 
molecule; and : 
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*(c) determirdng the presence or amount of the probe bound to said nucleic 
acid molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said 
sample. 

20. A method of identifying an agent that binds to a polypeptide of claim 1, the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 

21. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 



whereby an alteration in expression or activity of said peptide indicates said agent 
modulates expression or activity of said polj^eptide. 

22. A method for modulating flie activity of the polypq)tide of claim 1, the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the 
activity of the polypeptide. 

23. A method of treating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
the polypeptide of claim 1 in an amount sufficient to treat or prevent said NOVX- 
associated disorder in said subject. 

24. The method of claim 23, wherein said subject is a human. 

25. A method of treating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
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(a) 
(b) 
(c) 



providing a cell expressing said polypeptide; . . 
contacting the cell with said agent; and 

detemcdning whether the agent modulates e;q)ression or activity of said 
polypeptide. 
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the nucleic acid of claim 5 in an amount sufficient to treat or prevent said NOVX- 
associated disorder in said subject. 

26. The method of claim 25, wherein said subject is a human. 

27. A method of treating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
the antibody of claim 15 in an amount sufficient to treat or prevent said NOVX- 
associated disorder in said subject. 

28. The method of claim 27, wherein the subject is a human. 

29. A pharmaceutical composition comprising the polypq^tide of claim 1 arid a 
phaimaceutically-acceptable carrier. 

30. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 and a 
pharmaceutically-acceptable carrier. 

31. A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable carrier. 

32. - A kit comprising in one or more containers, the pharmaceutical composition of claim 

29. 

33. A kit comprising in one or more containers, the pharmaceutical composition of claim 
30. 

34. A kit comprising in one or more containers, the' pharmaceutical composition of claim 
'31. 

35. The use of a therapeutic in the manufacture of a medicament for treating a syndrome 
associated with a human disease, the disease selected firom.a NOVX-associated 
disorder, wherein said therapeutic is selected firom the group consisting of a NOVX 
polypeptide, a NOVX nucleic acid, and a NOVX antibody. 
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36. . Amethodfor screemng for a modulator of activity or Of . latency or predis^^^ 

NOVX-associated disorder, said method comprising: 

(a) administering a test compomid to a test animal at increased risk for a 

NOVX-associated disorder, wherein said test animal recombinantly 

expresses the polypqjtide of claim 1 ; 

(b) measuring the activity of said polypeptide in said test animal after 

administering the compound of step (a); 

(c) comparing the activity of said protein in said test animal with the 
activity of said polypeptide in a control animal not administered said 
polypeptide, wherein a change in the activity of said polypeptide in said 
test animal relative to said control animal indicates the test compound is 
a modulator of latency of or predisposition to a NOVX-associated 

disorder. 

37. The method of claim 36, wherein said test animal is a recombinant test animal that 
expresses a test protein transgene or expresses said transgene under the control of a 
promoter at an increased level relative to a wild-type test animal, and.wherem said 
promoter is not the native gene promoter of said transgene. 

38. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammaUan subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide m a sample &om 
the first mammalian subject; and 

(b) comparing the amount ofsaid polypeptide in the sample of step (a) to 

the amount of the polypeptide present in a control sample firom a second 
mammalian subject known not to have, or not to be predisposed to, said 
diseas^, 

wherem an alteration in the escpression level of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said 
disease. 
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39. A metJiod for deleriiiir^ the presence of or predisposition to a disease associated with 
altered levels of the iiucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to 
the amoimt of the nucleic acid present in a control sample from a second 
mammalian subject known not to have or not be predisposed to, the 
disease; - 

wherein an alteration in the level of the nucleic acid in the first subject as compared to 
the control sample indicates the presence of or predisposition to the disease. 

40. A method of treating a pathological state in a nMmmal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate 
the pathological state, wherein the polypeptide is a polypeptide havmg an amino acid , 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of 
at least one of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32 
and/or 34, or a biologically active fragment thereof. 

41 . A method of treating a pathological state in a mammal, the method comprismg 
administering to the mammal the antibody of claim IS in an amount sufficient to 
alleviate the patiiological state. 
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